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GENE FAMILIES ASSOCIATED WITH CANCERS 

FBELP OF THE INVENTION 

The present invention relates to the changes in gene expression in human tissues 
5 from cancer patients. The invention specifically relates to human genes which are 
differentially expressed in cancer tissues of breast, colon, esophagus, kidney, liver, lung, 
lymph node, ovary, pancreas, prostate, rectum, and/or stomach compared to corresponding 
normal tissues. 

BACKGROUND OF THE INVENTION 

10 In the United States, more than one million new cancer cases are diagnosed and 

about half million people die of cancer. The causes of cancer are many and varied, and 
include genetic predisposition, environmental influences, infectious agents and ageing. 
These transform normal cells into cancerous ones by derailing a wide spectrum of 
regulatory and downstream effector pathways. Several essential alterations in cell 

15 physiology collectively dictate malignant growth: self-sufiBciency in growth signals, 
insensitivity to growth-inhibitory signals, evasion of programmed cell death, limitiess 
replicative potential, sustained angiogenesis, and tissue invasion and metastasis (Hanahan 
and Weinberg (2000), Cell 100:57-70). 

To date, researchers have been able to identify many genetic alterations believed to 
20 underlie tumor development. These genetic alterations include amplification of oncogenes 
and mutations that result m the loss of tumor suppressor genes. Oncogenes were initially 
identified as genes carried by vimses that cause transformation of their target cells. A 
major class of the viral oncogenes have cellular counterparts that are involved in normal 
cell functions. The cellular genes are called proto-oncogene, and in certain cases their 
25 mutation or aberrant in the cell is associated with tumor formation. The generation of a 
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oncogene represents a gain-of-&nction in which a cellular proto-oncogene is 
inappropriaiely activated. This can involve a mutational change in the protein, or 
constitutive activation, over-e3q)ression, or feUure to turn off expression at the appropriate 
time. About 100 oncogenes have been identified. Examples of oncogenes include, but are 
5 not limited to, ras. fos, myc, abl, and myb (Ponder (2001). Nature 411:336-341). Tumor 
suppressor genes, in their wild-type alleles, express proteins that suppress abnormal 
cellular proliferation. When the gene coding for a tumor suppressor protein is mutated or 
deleted, the resulting mutant protem or the complete lack of tumor suppressor protein 
expression may fail to correctly regulate cellular proliferation, and abnormal proUferation 
10 may take place, particularly if there is already existing damage to the cellular regulatory 
mechanism. A number of well-studied human tumors and tumor cell lines have missing or 
non-fimctional tumor suppressor genes. Examples of tumor suppressor genes include, but 
are not limited to, the retinoblastoma susceptibility gene or RB gene, the p53 gene, the 
deletion in colon carcinoma (DCC) gene and the neurofibromatosis type 1 (NF-1) tumor 
15 suppressor gene (Weinberg (1991). Science 254:1138-1146). Loss-of-fiinction or 
inactivation of tumor suppressor genes may play a central role in the initiation and/or 
progression of a significant number of human cancers. 

The utilization of genome-wide expression profiles to classify tumors, to identify 
drug targets, to identify diagnostic markers and/or to gain fiirther insights into the 

20 consequences of chemotherapeutic treatments could fecilitate the design of more 
efficacious stratagems for treating a variety of cancers. Initial studies utilizing gene 
expression patterns to identify subtypes of cancer produced rather intriguing results (see 
Peroue/a/. (\999\ Proc Natl Acad Sci USA 96:9212-9217; Golub etal. (1999), Science 
286:531-537; Alizadeh et al (2000), Nature 403:503-511; Alon et al (1999), Proc Natl 

25 AcadSci USA 96:6745-6750; and Bittner et al. (2000), Nature 406:536-540; Perou et al. 
(2000), Nature 406:747-752). Molecular classification of B-cell lymphoma by gene 
expression profiling elucidated clinically distinct diffuse large-B-cell lymphoma 
subgroups (see Alizadeh et al., supra). In breast cancer, studies utilizing limited numbers 
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of genes (8,102 genes) have classified tumors into subtypes based on gene expression 
profiles, and this study indicated a diversity of molecular phenotypes associated with 
breast tumors (see Perou et aL, supra). In addition, the expression profilmg has enabled 
researchers to map tissue-specific expression levels for thousands of genes (Alon et aL 

5 (1999), Proc Natl Acad Sci USA 96:6745-6750; Iyer et al (1999), Science 283:83-87; 
Khan et a/.(1998). Cancer Res 58:5009-5013; Lee et a/.(1999). Science 285:1390-1393; 
Wang et al. (1999), Gene 229:101-108; Whitney et al. (1999), Ann Neurol 46:425-428). 
Although these studies have demonstrated that expression profiling may be used to 
produce improvements in diagnosis of human diseases such as cancer, as well as in the 

1 0 development of improved therapeutic strategies, further studies are needed. 

Although cancers are diverse and heterogeneous as they are derived firom 
numerous tissues and multiple etiologic factors, it has been suggested that underlymg this 
variability lies a relatively small number of critical events whose convergence is required 
for the development of any and all cancers ^van and Vousden (2001), Nature 411:342- 

15 348). Accordingly, there exists a need for the comprehensive investigation of the changes 
in global gene expression levels in many different types of cancers to identify critical 
molecular markers associated with the development and progression of cancer. There 
remains a need in the art for materials and methods that permit a more accurate diagnosis 
of cancer. In addition, there remains a need in the art for methods to treat and methods to 

20 identify agents that can effectively treat this disease. The present invention meets these 
and other needs. 

SUMMARY OF TPTC TNVRNnrtON 

The present invention is based on new genes that are differentially expressed in 
cancer tissues compared to normal tissues, hereinafter LFGl, LFG2, LFG3, LFG4, LFG5, 
25 LFG6, respectively. The invention includes isolated nucleic acid molecules comprising 
SEQ ID NO: 1, 3, 5, 7, 9, 1 1, 13 or 15 or the complement thereof. 
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The present invention further includes the nucleic acid molecules operably linked 
to one or more expression control elements, including vectors comprising the isolated 
nucleic acid molecules. The invention fiuther includes host cells transformed to contmn 
the nucleic acid molecules of the invention and methods for producing a protein 
5 comprising the step of culturing a host cell transformed with a nucleic acid molecule of 
the invention under conditions in which the protein is expressed. 

The invention further provides an isolated polypeptide selected from the group 
consisting of an isolated polypeptide comprising the amino acid sequence of SEQ ID NO: 
2, 4, 6, 8, 10, 12, 14 or 16, an isolated polypeptide comprising a fragment of at least 10 

10 amino acids of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16, an isolated polypeptide 
comprising conservative amino acid substitutions of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 
16 and an isolated polypeptide comprising naturally occurring amino acid sequence 
variants of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16. Polypeptides of the invention also 
include polypeptides with an amino acid sequence having at least about 50%, 60%, 70% 

15 or 75% amino acid sequence identity with the sequence set forth in SEQ ID NO: 2, 4, 6, 8, 
10, 12, 14 or 16, preferably at least about 80%, more preferably at least about 90-95%, and 
most preferably at least about 95-98% sequence identity with the sequence set forth in 
SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16. 

The present invention further provides methods of identifying other members of 
20 the polypeptide family of the invention. Specifically, the nucleic acid sequence of SEQ ID 
NO: 1, 3, 5, 7, 9, 11, 13 or 15 can be used as a probe, or to generate PGR primers, in 
methods to identify nucleic acid molecules tfiat encode other members of the LFGl, LFG2, 
LFG3, LFG4, LFG5 or LFG6 family of proteins. 

The invention further provides an isolated antibody or antigen-binding antibody 
25 fragment that specifically binds to a polypeptide of the invention, including monoclonal 
and polyclonal antibodies. 
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The invention further provides methods of identifying an agent which modulates 
the expression of a nucleic acid molecule encoding a protein of the invention, comprising: 
exposing cells which express the nucleic acid molecule to the agent; and determining 
whether the agent modulates expression of said nucleic acid molecule, thereby identifying 
5 an agent which modulates the expression of a nucleic acid molecule encoding the protein. 

The invention fixrther provides methods of identifying an agent which modulates 
the level of or at least one activity of a protein of the invention, comprising: exposing cells 
which express the protein to the agent; and determining whether the agent modulates tiie 
level of or at least one activity of said protein, thereby identifying an agent which 
10 modulates the level of or at least one activity of the protein. 

The present invention further provides methods of modulating the expression of a 
nucleic acid molecule encoding a protein of the invention, comprising tiie step of 
administering an effective amount of an agent which modulates tiie expression of a nucleic 
acid molecule encoding the protein. The invention also provides methods of modulating 
15 at least one activity of a protein of the invention, comprising the step of administering an 
effective amount of an agent which modulates at least one activity of the protein of tiie 
invention. 

The invention furtfier provides metiiods of identifying binding partners for a 
protein of the invention, comprising tiie steps of exposing said protein to a potential 
20 binding partner; and determming if tiie potential binding partner binds to said protein, 
thereby identifying binding partners for the protem. 

The present mvention fiirther provides methods to identify agents tiiat can block or 
modulate the association of a protein of tiie invention witii a bmding partner. Specifically, 
an agent can be tested for the ability to block, reduce or otherwise modulate tiie 
25 association of a protein of invention witii a binding partner by contacting said protein, or a 
fragment tiiereof, and a binding partner witii a test agent and determining whether the test 
agent blocks or reduces the buiding of tiie protein of invention to the bin^g partner. 

5 
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The present invention further provides methods for reducing or blocking the 
association of a protein of invention with one or more of its binding partners, comprismg 
the step of administrating an effective amount of an agent which reduces or blocks the 
bmding of said protein to the binding partner. The method can utilize an agent that binds 
5 to the protein of mvention or to the binding partner. 

In accordance with another aspect of the invention, the proteins of the invention 
can be used as starting points for rational drug design to provide ligands, therapeutic drugs 
or other types of small chemical molecules. Alternatively, small molecules or other 
compounds identified by the above-described screening assays may serve as "lead 
10 compounds" in rational drug design. 

The present invention further relates to a process for treating cancer comprising 
inserting into a cancerous cell a nucleic acid construct comprising the nucleic acid 
molecules of the invention operably linked to a promoter or enhancer element such that 
expression of said nucleic acid molecule causes suppression of said cancer. 

15 The present invention further includes non-human transgenic animals modified to 

contam the nucleic acid molecules of the invention, or non-human transgenic animals 
modified to contain the mutated nucleic acid molecules such that e3q)ression of the 
encoded polypeptides of tiie invention is prevented. 

The present invention also includes non-human transgenic animals in which all or 
20 a portion of a gene comprising all or a portion of SEQ ID NO: 1, 3, 5, 7, 9, 1 1, 13 or 15 
has been knocked out or deleted fix)m the genome of the animal. 

The invention further provides methods of diagnosing cancers, comprising the 
steps of acquiring a tissue, blood, urine or other sample firom a subject and determining the 
level of expression of a nucleic acid molecule of the invention or polypeptide of the 
25 invention. 
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The invention further includes compositions comprising a diluent and a 
polypeptide or protein selected ftom the group consisting of an isolated polypeptide 
comprising the amino acid sequence of SEQ ID NO: 2. 4, 6, 8, 10, 12, 14 or 16, an 
isolated polypeptide comprising a fragment of at least 10 amino acids of SEQ ID NO: 2, 4, 

5 6, 8, 10, 12, 14 or 16, an isolated polypeptide comprising conservative amino acid 
substitutions of SEQ ID NO: 2, 4. 6, 8, 10. 12, 14 or 16. naturally occurring amino acid 
sequence variants of SEQ ID NO: 2, 4, 6, 8, 10. 12, 14 or 16 and an isolated polypeptide 
with an ammo acid sequence having at least about 50%, 60%, 70% or 75% amino acid 
sequence identity with the sequence set forth in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16, 

10 preferably at least about 80%, more preferably at least about 90-95%. and most preferably 
at least about 95-98% sequence identity with the sequence set forth in SEQ ID NO: 2, 4, 6, 
8. 10, 12, 14 or 16. 

TOTCF PESCRIPTT ^T^ TUTC DRAWINGS 

Figure 1 shows tiie relative alignment positions of the two LFGl clones. 

15 Figure 2 is a hydrophobicity plot of the protein encoded by the open reading frame 

of LFGl-Clone A (SEQ ID NO: 2). Analysis was performed according to the method of 
Kyte-DoolitUe. 

Figure 3 is a hydrophobicity plot of the protein encoded by the open reading frame 
of LFGl-Clone B (SEQ ID NO: 4). Analysis was performed according to the method of 
20 Kyte-Doolittle. 

Figure 4 is a hydrophobicity plot of the protein encoded by the open reading frame 
of LFG2 (SEQ ID NO: 6). Analysis was performed according to the method of Kyte- 
Doolittle. 



7 



wo 2004/035789 




CT/KR2003/002161 



Figure 5 is a hydrophobicity plot of the protein encoded by the open reading frame 
of LFG3 (SEQ ID NO: 8). Analysis was performed according to the method of Kyte- 
Doolittle. 

Figure 6 is a hydrophobicity plot of the protein encoded by the open reading frame 
5 of LFG4 (SEQ ID NO: 10). Analysis was performed according to the method of Kyte- 
Dooiittle. 

Figure 7 is a hydrophobicity plot of the protein encoded by the open reading frame 
of ALFG5 (SEQ ID NO: 12). Analysis was performed according to the method of Kyte- 
Doolittle. 

10 Figure 8 shows the relative alignment positions of the two LFG6 clones. 

Figure 9 is a hydrophobicity plot of the protem encoded by the open reading frame 
of LFG6-#20 (SEQ ID NO: 14). Analysis was performed according to tfie method of 
Kyte-Doolittle. 

Figure 10 is a hydrophobicity plot of the protein encoded by the open reading 
15 frame of LFG6-#46 (SEQ ID NO: 16). Analysis was performed according to the method 
of Kyte-Doolittle. 

DETAn.RTI DRSCRTPTIGN CkV TTTF. PREFERRED EMBODIMENT 

/. General Description 

The present invention is based in part on the identification of new gene families 
20 that are differentially expressed in cancerous human tissues compared to normal human 
tissues. These gene families correspond to the human cDNA of SEQ ID NOS: 1, 3, 5, 7, 9, 
11, 13 and 15. 
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The genes and proteins of the invention may be used as diagnostic agents or 
markers to detect cancer or to differentiate carcinoma from normal tissue in a sample. 
They can also serve as a target for agents that modulate gene expression or activity- For 
example, agents may be identified that modulate biological processes associated with 
5 tumor growth, including the hyperplastic process of cancer. 

n. Specific Embodiments 

A. The Proteins Associated with Cancer 

The present invention provides isolated protems, allelic variants of the protems, 
and conservative amino acid substitutions of the proteins. As used herein, tiie **protein" or 

10 "polypeptide" refers, in part, to a protein that has the hiunan amino acid sequence depicted 
in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16. The terms also refer to naturally occurring 
allelic variants and proteins that have a slightly different amino acid sequence than that 
specifically recited above. Allelic variants, though possessing a slightly different amino 
acid sequence than those recited above, will still have the same or similar biological 

1 5 functions associated with these proteins. 

As used herein, the family of protems related to the human amino acid sequence of 
SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16 refers to protems that have been isolated from 
organisms in addition to humans. The methods used to identify and isolate other members 
of the family of proteins related to these proteins are described below. 

20 The proteins of the present invention are preferably in isolated form. As used 

herein, a protein is said to be isolated when physical, mechanical or chemical methods are 
employed to remove the protein from cellular constituents that are normally associated 
with the protein. A skilled artisan can readily employ standard purification methods to 
obtain an isolated protem. 
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Hie proteins of the present invention fiirdier include insertion, deletion or 
conservative amino acid substitution variants of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16. 
As used herein, a conservative variant refers to alterations in the amino acid sequence that 
do not adversely affect the biological fiinctions of the protein. A substitution, insertion or 

5 deletion is said to adversely affect the protein when the altered sequence prevents or 
disrupts a biological function associated with the protein. For example, the overall charge, 
structure or hydrophobic/hydrophilic properties of the protein, in certain instances, may be 
altered wiUiout adversely affecting a biolo^cal activity. Accordmgly, tiie amino acid 
sequence can be altered, for example to render the peptide more hydrophobic or 

10 hydrophilic, without adversely affecting tiie biological activities of tiie protein. 

Ordinarily, the allelic variants, the conservative substitution variants, and the 
members of the protein family, will have an amino acid sequraice having at least about 
50%, 60%, 70% or 75% amino acid sequence identity with the sequence set forth in SEQ 
ID NO: 2, 4, 6, 8, 10, 12, 14 or 16, more preforably at least about 80%, even more 

15 preferably at least about 90-95%, and most preferably at least about 95-98% sequence 
identity. Identity or homology with respect to such sequences is defined herein as the 
percentage of amino acid residues in the candidate sequence that are identical with SEQ 
ID NO: 2, 4, 6, 8, 10, 12, 14 or 16, after aligning the sequences and introducing gaps, if 
necessary, to achieve the maximum percent homology, and not considering any 

20 conservative substitutions as part of the sequence identity (see section B for the relevant 
parameters). Fusion proteins, or N-terminal, C-terminal or internal extensions, deletions, 
or insertions into the peptide sequence shall not be construed as affecting homology. 

Hius, the proteins of the present invention include molecules having the amino 
acid sequence disclosed in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16; fiagments thereof 
25 having a consecutive sequaice of at least about 3, 4, 5, 6, 10, 15, 20, 25, 30, 35 or more 
amino acid residues of these proteins; ammo acid sequence variants wherein one or more 
amino acid residues has been inserted N- or C-termmal to, or within, the disclosed coding 
sequence; and amino acid sequence variants of the disclosed sequence, or their firagments 

10 
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as defined above, that have been substituted by at least one residue. Such fragments, also 
referred to as peptides or polypeptides, may contain antigenic regions, fimctional regions 
of the protein identified as regions of the amino acid sequence which correspond to known 
protein domains, as well as regions of pronounced hydrophilicity. The regions are all 
5 easily identifiable by using commonly available protein sequence analysis software such 
as MacVector (Oxford Molecular). 

Contemplated variants further include those containing predetermined mutations 
by, eg., homologous recombination, site-directed or PGR mutagenesis, and the 
corresponding protems of other animal species, including but not limited to rabbit, mouse, 
10 rat, porcine, bovine, ovine, equine and non-human primate species, and the alleles or other 
naturally occurring variants of the femUy of proteins; and derivatives wherein the protein 
has been covalently modified by substitution, chemical, enzymatic, or other appropriate 
means with a moiety other than a naturally occurring amino acid (for example a detectable 
moiety sudi as an enzyme or radioisotope). 

15 The present invention further provides compositions comprising a protem or 

polypeptide of the invention and a diluent. Suitable diluents can be aqueous or non- 
aqueous solvents or a combmation thereof, and can comprise additional components, for 
example water-soluble salts or glycerol, that contribute to the stability, solubility, activity, 
and/or storage of the protein or polypeptide. 

20 As described below, members of the families of proteins can be used: (1) to 

identify agents which modulate the level of or at least one activity of the protem, (2) to 
identify binding partners for the protein, (3) as an antigen to raise polyclonal or 
monoclonal antibodies, (4) as a therapeutic agent or target and (5) as a diagnostic agent or 
marker of cancer. 

25 B. Nucleic Acid Molecules 
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The present invention further provides nucleic acid molecules that encode the 
protein having SEQ ID NO: 2. 4. 6, 8. 10. 12, 14 or 16 and the related proteins herein 
described, preferably in isolated form. As used herein, "nucleic acid" is defined as RNA 
or DNA that encodes a protein or peptide as defined above, is complementary to a nucleic 
5 acid sequence encoding such peptides, hybridizes to the nucleic acid of SEQ ID NO: 1, 3, 
5, 7, 9, 1 1, 13 or 15 and remains stably bound to it under appropriate stringency conditions, 
encodes a polypeptide sharing at least about 50%, 60%. 70% or 75%. preferably at least 
about 80%, more preferably at least about 90-95%. and most preferably at least about 95- 
98% or more identity with the peptide sequence of SEQ ID NO: 2, 4. 6, 8. 10, 12. 14 or 16 
10 or exhibits at least 50%. 60%, 70% or 75%, preferably at least about 80%, more preferably 
at least about 90-95%, and most preferably at least about 95-98% or more nucleotide 
sequence identity over the open reading frames of SEQ ID NO: 1, 3, 5, 7. 9, 1 1, 13 or 15. 

The present invention further includes isolated nucleic acid molecules that 
specifically hybridize to the complement of SEQ ID NO: 1. 3, 5, 7, 9. 11, 13 or 15, 
15 particularly molecules that specifically hybridize over the open reading frames. Such 
molecules that specifically hybridize to the complement of SEQ ID NO: 1, 3, 5. 7, 9. 1 1, 
13 or 15 typically do so under stringent hybridization conditions. 

SpecificaUy contemplated are genomic DNA, cDNA. mRNA and antisense 
molecules, as well as nucleic acids based on alternative backbones or including alternative 
bases, whether derived from natural sources or synthesized. Such hybridizing or 
complementary nucleic acids, however, are defined fijrther as being novel and unobvious 
over any prior art nucleic acid including that which encodes, hybridizes under appropriate 
stringency conditions, or is complementary to nucleic acid encoding a protein according to 
the present invention. 

Homology or identity at tiie nucleotide or amino acid sequence level is determined 
by BLAST (Basic Local Alignment Search Tool) analysis using the algorithm employed 
by the programs blastp. blastn. blastx. tblastn and tblastx (Altschul et al (1997), Nucleic 



20 
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Acids Res. 25: 3389-3402, and Karlin et al. (1990), Proc. Natl Acad. Sci. USA 87: 2264- 
2268, both fully incorporated by reference) which are tailored for sequence similarity 
searching. The approach used by the BLAST program is to first consider similar segments, 
with and without gaps, between a query sequence and a database sequence, then to 

S evaluate the statistical significance of all matches that are identified and finally to 
summarize only those matches which satisfy a preselected threshold of significance. For a 
discussion of basic issues in similarity searching of sequence databases, see Altschul et al. 
(1994), Nat. Genet. 6: 119-129 which is fully incorporated by reference. The search 
parameters for histogram, descriptions, alignments, expect (i.e., the statistical significance 

10 threshold for reporting matches against database sequences), cutoff, matrix and filter (low 
complexity) are at the default settings. The default scoring matrix used by blastp, blastx, 
tblastn, and tblastx is the BLOSUM62 matrix (Henikoff al. (1992), Proc, Natl Acad. 
Sci. USA 89: 10915-10919, fiilly incorporated by reference), recommended for query 
sequences over 85 nucleotides or amino acids in length. 

15 For blastn, the scoring matrix is set by the ratios of M (z.e., the reward score for a 

pair of matchmg residues) to N (i.e., the penalty score for mismatching residues), wherem 
the default values for M and N are 5 and -4, respectively. Four blastn parameters were 
adjusted as follows: Q=10 (gap creation penalty); R=10 (gap extension penalty); winkr=l 
(generates word hits at every wink* position along the query); and gapw=16 (sets the 

20 window width within which gapped alignments are generated). The equivalent Blastp 
parameter settings were Q=9; R=2; winkr=l; and gapw=32. A Bestfit comparison between 
sequences, available in the GCG package version 10.0, uses DNA parameters GAP=50 
(gap creation penalty) and LEN=3 (gap extension penalty) and the equivalent settings in 
protem comparisons are GAP=8 and LEN=2. 

25 "Stringent conditions" are those that (1) employ low ionic strength and high 

temperature for washing, for example, 0.015 M NaCl/0.0015 M sodium citrate/0.1% SDS 
at 50^C, or (2) employ during hybridization a denaturing agent such as formamide, for 
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example, 50% (vol/vol) formamide with 0.1% bovine seram albumiii/0.1% Ficoll/0.1% 
polyvinyIpyrrolidone/50 mM sodium phosphate buffer at pH 6,5 with 750 mM NaCl, 75 
mM sodium citrate at 42^C. Another example is hybridization m 50% formamide, 5x 
SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% 

5 sodium pyrophosphate, 5x Denhardt's solution, sonicated sahnon sperm DNA (50 |ig/ml), 
0.1% SDS, and 10% dextran sulfate at 42°C, with washes at 42^C in 0.2x SSC and 0.1% 
SDS. A skilled artisan can readily determine and vary the stringency conditions 
appropriately to obtain a clear and detectable hybridization signal. Preferred molecules 
are those that hybridize under the above conditions to the complement of SEQ ID NO: 1, 3, 

10 5, 7, 9, 11, 13 or 15 and which encode a functional or full-length protein. Even more 
preferred hybridizing molecules are those that hybridize under the above conditions to the 
complement strand of the open reading fiame of SEQ ID NO: 1, 3, 5, 7, 9, 1 1, 13 or 15. 

As used herein, a nucleic acid molecule is said to be "isolated'' when the nucleic 
acid molecule is substantially separated from contaminant nucleic acid molecules 
1 5 encoding other polypeptides. 

The present invention further provides fragments of the disclosed nucleic acid 
molecules. As used herem, a fragment of a nucleic acid molecule refers to a small portion 
of the coding or non-coding sequence. The size of the fragment will be determined by the 
intended use. For example, if the fragment is chosen so as to encode an active portion of 
20 the protein, the fragment will need to be large enough to encode the fimctional region(s) of 
the protein. For instance, fragments which encode peptides corresponding to predicted 
antigenic regions may be prepared. If the fragment is to be used as a nucleic acid probe or 
PGR primer, then the fragment length is chosen so as to obtain a relatively small number 
of false positives during probing/priming (see the discussion in Section G). 

25 Fragments of the nucleic acid molecules of the present invention (/,e., synthetic 

oligonucleotides) that are used as probes or specific primers for the polymerase chain 
reaction (PGR), or to synthesize gene sequences encoding proteins of the invention, can 
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easily be synthesized by chemical techniques, for example, the phosphoramidite method 
of Matteucci et a/-, ((1981) J. Am. Chem. Soc. 103: 3185-3191) or usmg automated 
synthesis methods. In addition, larger DNA segments can readily be prepared by well 
known methods, such as synthesis of a group of oligonucleotides that define various 
5 modular segments of the gene, followed by ligation of oligonucleotides to build the 
complete modified gene. 

The nucleic acid molecules of the present invention may further be modified so as 
to contain a detectable label for diagnostic and probe purposes. A variety of such labels 
are known in the art and can readily be employed with tiie encoding molecules herem 
10 described. Suitable labels include, but ate not limited to, biotin, radiolabeled or 
fluorescently labeled nucleotides and the like. A skilled artisan can readily employ any 
such label to obtain labeled variants of the nucleic acid molecules of the invention. 

C. Isolation of Other Related Nucleic Acid Molecules 

As described above, the identification and characterization of the nucleic acid 
15 molecule having SEQ ID NO: 1, 3, 5, 7, 9, U, 13 or 15 allows a skilled artisan to isolate 
nucleic acid molecules that encode other members of the protein family in addition to the 
sequences herein described. Further, the presently disclosed nucleic acid molecules allow 
a skilled artisan to isolate nucleic acid molecules that encode other members of the family 
of proteins in addition to the proteins havmg SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16. 

20 For instance, a skilled artisan can readily use the amino acid sequence of SEQ ID 

NO: 2, 4, 6, 8, 10, 12, 14 or 16 to generate antibody probes to screen expression libraries 
prepared from appropriate cells. Typically, polyclonal antiserum from mammals such as 
rabbits immunized with the purified protein (as described below) or monoclonal 
antibodies can be used to probe a mammalian cDNA or genomic expression library, such 

25 as lambda gtU library, to obtain the appropriate coding sequence for other members of the 
protein family. The cloned cDNA sequence can be expressed as a fiision protein, 
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expressed directly using its own control sequences, or expressed by constructions using 
control sequences appropriate to the particular host used for expression of the enzyme. 

Alternatively, a portion of the coding sequence herein described can be 
synthesized and used as a probe to retrieve DNA encoding a member of the protein family 
5 from any mammalian organism. Oligomers containing approximately 18-20 nucleotides 
(encoding about a 6-7 amino acid stretch) are prepared and used to screen genomic DNA 
or cDNA libraries to obtain hybridization under stringent conditions or conditions of 
sufficient stringency to eliminate an undue level of false positives. 

Additionally, pairs of oligonucleotide primers can be prepared for use in PGR to 
10 selectively clone an encodmg nucleic acid molecule. A PGR denature/anneal/extend cycle 
for using such PGR primers is well known in the art and can readily be adapted for use in 
isolating other encoding nucleic acid molecules. 

Nucleic acid molecules encoding other members of tilie protem family may also be 
identified in existing genomic or other sequence information using any available 
15 computational method, including but not Ihnited to: PSI-BLAST (Altschul et al (1997), 
Nucl Acids Res. 25: 3389-3402); PHI-BLAST (Zhang et al (1998), Nucl Acids Res. 26: 
3986-3990), 3D-PSSM (Kelly et al (2000), J. Mol Biol 299: 499-520); and other 
computational analysis methods (Shi etal (1999), Biochem. Biophys. Res. Commun. 262: 
132-138 and Matsunami et. al (2000), Nature 404: 601-604). 

20 D. rDNA molecules Containing a Nucleic Acid Molecule 

The present invention further provides recombinant DNA molecules (rDNAs) that 
contam a codmg sequence. As used herein, a rDNA molecule is a DNA molecule that has 
been subjected to molecular manipulation in situ. Methods for generating rDNA 
molecules are well known m the art, for example, see Sambrook et al. Molecular 
25 Cloning- A Lahotatorv Manual. Third Ed.. Gold Spring Harbor Laboratory Press, Gold 
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Spring Harbor, NY, 2001. In the preferred rDNA molecules, a coding DNA sequence is 
operably linked to ejcpression control sequences and/or vector sequences. 

The choice of vector and/or expression control sequences to which one of the 
protein family encoding sequences of the present invention is operably linked depends 
5 directly, as is well known in the art, on the functional properties desired, e.g., protein 
expression, and the host cell to be transformed. A vector contemplated by the present 
invention is at least capable of directing the replication or insertion mto the host 
chromosome, and preferably also expression, of the structural gene included in the rDNA 
molecule. 

10 Expression control elements that are used for regulating the expression of an 

operably linked protein encoding sequence are known in the art and include, but are not 
limited to, inducible promoters, constitutive promoters, secretion signals, and other 
regulatory elements. Preferably, the inducible promoter is readily controlled, such as 
being responsive to a nutrient in the host cell's medium. 

15 In one embodiment, the vector containing a codmg nucleic acid molecule will 

include a prokaryotic replicon, i.e., a DNA sequence having the ability to du^ct 
autonomous replication and maintenance of the recombinant DNA molecule 
extrachromosomally in a prokaryotic host cell, such as a bacterial host cell, transformed 
therewith. Such replicons are well known m the art In addition, vectors that mclude a 

20 prokaryotic replicon may also include a gene whose expression confers a detectable 
marker such as a drug resistance. Typical bacterial drug resistance genes are tiiose that 
confer resistance to ampicillm, kanamycin, chloramphenicol or tetracycline. 

Vectors that hiclude a prokaryotic replicon can further include a prokaryotic or 
bacteriophage promoter capable of directing the expression (transcription and translation) 
25 of the coding gene sequences in a bacterial host cell, such as R coli. A promoter is an 
expression control element formed by a DNA sequence that permits binding of RNA 
polymerase and transcription to occur. Promoter sequences compatible witii bacterial 
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hosts are typically provided in plasmid vectors containing convenient restriction sites for 
insertion of a DNA segment of the present invention. Typical of such vector plasmids are 
pUC8, pUC9, pBR322 and pBR329 available from BioRad Laboratories, (Richmond, CA), 
pPL and pKtC223 available from Pharmacia (Piscataway, NJ). 

5 Expression vectors compatible with eukaryotic cells, preferably those compatible 

with vertebrate cells, can also be used to form rDNA molecules that contain a coding 
sequence. Eukaryotic cell expression vectors, including viral vectors, are well known in 
the art and are available from several commercial sources. Typically, such vectors are 
provided containing convenient restriction sites for insertion of the desired DNA segment. 

10 Typical of such vectors are pSVL and pKSV-10 (Pharmacia), pBPV-l/pML2d 
(Intemational Biotechnologies, Inc.), pTDTl (ATCC, #31255), the vector pCDM8 
described herein, and the like eukaryotic expression vectors. Vectors may be modified to 
include tissue specific promoters if needed. 

Eukaryotic cell expression vectors used to construct the rDNA molecules of the present 
15 invention may further include a selectable marker that is effective in an eukaryotic cell, 
preferably a drug resistance selection marker. A preferred drug resistance marker is the 
gene whose expression results in neomycin resistance, i.e., the neomycin 
phosphotransferase (ned) gene. (Southern et al (1982), J. Mol Anal Genet 1:327-341). 
Alternatively, the selectable marker can be present on a separate plasmid, and the two 
20 vectors are introduced by co-transfection of the host cell, and selected by culturing in the 
appropriate drug for the selectable marker. 

E. Host Cells Containing an Exogenously Supplied Coding Nucleic Acid Molecule 

The present invention further provides host cells transformed with a nucleic acid 
molecule that encodes a protein of the present invention. The host cell can be either 
25 prokaryotic or eukaryotic. Eukaryotic cells useful for expression of a protein of the 
invention are not limited, so long as the cell line is compatible with cell culture methods 
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and compatible with the propagation of the expression vector and expression of the gene 
product Preferred eukaryotic host cells include, but are not limited to, yeast, msect and 
mammalian cells, preferably vertebrate cells such as those from a mouse, rat, monkey or 
human cell line. Preferred eukaryotic host cells include Chinese hamster ovary (CHO) 
5 cells available from the ATCC as CCL61, NIH Swiss mouse embryo cells (NIH/3T3) 
available from the ATCC as CRL 1658, baby hamster kidney cells (BHK), and the like 
eukaryotic tissue culture cell lines. 

Any prokaryotic host can be used to express a rDNA molecule encoding a protein 
of the invention. The preferred prokaryotic host is E. colu 

10 Transformation of appropriate cell hosts with a rDNA molecule of the present 

invention is accomplished by well known methods that typically depend on the type of 
vector used and host system employed. With regard to transformation of prokaryotic host 
cells, electroporation and salt treatment methods are typically employed (see, for example, 
Cohen et al (1972), Proc, Natl Acad. Set USA 69: 2110; and Sambrook et al, supra). 

15 With regard to transformation of vertebrate cells with vectors containing rDNAs, 
electroporation, cationic lipid or salt treatment methods are typically employed, see, for 
example, Graham et al (1973), Virol 52: 456; Wigler et al (1979), Proc. Natl Acad. Set 
USA 76: 1373-1376. 

Successfully transformed cells, le.^ cells that contam a rDNA molecule of the 
20 present invention, can be identified by well known techniques including the selection for a 
selectable marker. For example, cells resulting from the introduction of an rDNA of the 
present invention can be cloned to produce single colonies. Cells from those colonies can 
be harvested, lysed and thek DNA content examined for the presence of the rDNA using a 
method such as that described by Southern, (1975) J. Mol Biol 98: 503 or Berent et al, 
25 (1985) Biotech, 3: 208, or the proteins produced from the cell assayed via an 
immunological method. 
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F. Production of Recombinant Proteins using a rDNA Molecule 

The present invention further provides methods for producing a protein of the 
invention using nucleic acid molecules herein described. In general terms, the production 
of a recombinant form of a protein typically involves the following steps: 

5 First, a nucleic acid molecule is obtamed that encodes a protein of Ihe invention, 

such as a nucleic acid molecule comprismg, consisting essentially of or consisting of SEQ 
ID NO: 1, 3, 5, 7, 9, 11, 13 or 15, or nucleotides 390-4883 or 390-4880 of SEQ ID NO: 1, 
or nucleotides 12-4907 or 12-4904 of SEQ ID NO: 3, or nucleotides 424-1911 or 424- 
1908 of SEQ ID NO: 5, or nucleotides 405-1838 or 405-1835 of SEQ ID NO: 7, or 

10 nucleotides 89-1 153 or 89-1150 of SEQ ID NO: 9, or nucleotides 223-1572 or 223-1569 
of SEQ ID NO: 11, or 418-1395 or 418-1392 of SEQ ID NO: 13, or nucleotides 271-1434 
or 271-1431 of SEQ ID NO: 15. If the encoding sequence is unmtemipted by mtrons, as 
are these open-readmg-frames, it is directly suitable for expression in any host. 

The nucleic acid molecule is then preferably placed in operable linkage with 
15 suitable control sequences, as described above, to form an expression unit containing the 
protein open readmg frame. The expression unit is used to transform a suitable host and 
the transformed host is cultured under conditions that allow the production of the 
recombinant protem. Optionally tiie recombinant protein is isolated from tiie medium or 
from tiie cells; recovery and purification of the protem may not be necessary m some 
20 instances where some impurities may be tolerated. 

Each of the foregouig steps can be done in a variety of ways. For example, the 
desired codmg sequences may be obtained from genomic fragments and used directiy in 
appropriate hosts. The construction of expression vectors tiiat are operable in a variety of 
hosts is accomplished using appropriate replicons and control sequences, as set fortii 
25 above. The control sequences, expression vectors, and transformation methods are 
dependent on the type of host cell used to express the gene and were discussed m detail 
earlier. Suitable restriction sites can, if not normally available, be added to tiie ends of tiie 
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coding sequence so as to provide an excisable gene to insert into these vectors. A skilled 
artisan can readily adapt any host/expression system known in the art for use with the 
nucleic acid molecules of the invention to produce recombinant protein. 

G. Methods to Identify Agents that Modulate the Expression of a Nucleic Acid 
5 Encoding the Genes Associated with Cancer 

Another embodiment of the present invention provides methods for identifying 
agents that modulate the expression of a nucleic acid encoding a protein of the invention 
such as a protein having the amino acid sequsaice of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 
16. Such assays may utilize any available means of monitoring for changes in the 
10 expression level of the nucleic acids of the invention. As used herem, an agent is said to 
modulate the expression of a nucleic acid of the invention if it is capable of up- or down- 
regulating expression of the nucleic acid in a cell. 

In one assay format, cell lines that contain reporter gene fusions between 
nucleotides from within the open reading frame defined by nucleotides 390-4883 of SEQ 

15 ID NO: 1, nucleotides 12-4907 of SEQ ID NO: 3, nucleotides 424-1911 of SEQ ID NO: 5, 
nucleotides 405-1838 of SEQ ID NO: 7, nucleotides 89-1153 of SEQ ID NO: 9, 
nucleotides 223-1572 of SEQ ID NO: 11, nucleotides 418-1395 of SEQ ID NO: 13, 
nucleotides 271-1434 of SEQ ID NO: 15, and/or tiie 5' and/or 3' regulatory elements and 
any assayable fusion partner may be prepared. Numerous assayable fusion partners are 

20 known and readily available mcluduig the firefly luciferase g«ie and tiie gene encodmg 
chloramphenicol acetyltransferase (Alam et al (1990), Anal Biochem. 188: 245-254). 
Cell Imes containmg tibie reporter gene fusions are then exposed to tiie agent to be tested 
under appropriate conditions and time. Differential expression of tiie reporter gene 
between samples exposed to tiie agent and control samples identifies agents v^ich 

25 modulate the expression of a nucleic acid of the mvention. 
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> Additional assay fonnats may be used to monitor the ability of the agent to 
modulate the e:q>ression of a nucleic acid encoding a protein of the mvention, such as the 
protein having SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16. For instance, mKNA expression 
may be monitored directly by hybridization to the nucleic acids of the invention. Cell 
5 lines are exposed to the agent to be tested under appropriate conditions and time and total 
RNA or mRNA is isolated by standard procedures such those disclosed in Sambrook et alj 
Molecular Clonme - A Laboratorv Manual. Third Ed.. Cold Sprmg Harbor Laboratory 
Press, Cold Spring Harbor, NY, 2001 . 

The preferred cells will be those derived from human tissue, for instance, biopsy 
10 tissue or cultured cells from patients with cancer. Cell lines such as ATCC breast ductal 
carcinoma cell lines (Catalogue Nos. CRL-2320, CRL-2338, and CRL-7345), ATCC 
colorectal adenocarcinoma cell lines (Catalogue Nos. CCL-222, CCL-224, CCL-225, 
CCL-234, CRL-7159, and CRL-7184), ATCC kidney clear cell carcinoma cell lines 
(Catalogue Nos. HTB-46 and HTB-47), ATCC renal cell adenocarcinoma cell lines 
15 (Catalogue Nos. CRL-1611, CRL-1932 and CRL-1933), ATCC liver hepatocellular 
carcinoma cell Imes (Catalogue Nos. CRL-2233, CRL-2234, and HB-8065), ATCC lung 
adenocarcinoma cell lines (Catalogue Nos. CRL-5944, CRL-7380, and CRL-5907), 
ATCC lymphoma cell lines (Catalogue Nos. CRL-7936, CRL-7264, and CRL-7507), 
ATCC ovary adenocarcinoma cell lines (Catalogue Nos. HTB-161, HTB-75, and HTB-76), 
20 ATCC pancreas adenocarcinoma cell lines (Catalogue Nos. CRL-1687, CRL-2119, and 
HTP-79), prostate adenocarinoma cell lines (Catalogue Nos. CKL-1435, CRL-2422, and 
CRL-2220), and ATCC gastric adenocarcinoma cell lines (Catalogue Nos. CRL-1739, 
CRL-1863, and CRL-1864) may be used. Altematively, other available cells or cell lines 
may be used. 

25 Probes to detect differences in RNA expression levels between cells exposed to the 

agent and control cells may be prepared from the nucleic acids of the invention. It is 
preferable, but not necessary, to design probes which hybridize only with target nucleic 
acids under conditions of high stringency. Only highly complementary nucleic acid 

22 



wo 2004/035789 



'CT/KR2003/002161 



hybrids form under conditions of high stringency. Accordingly, the stringency of the 
assay conditions determines the amount of complementarity which should exist between 
two nucleic acid strands in order to form a hybrid. Stringency should be chosen to 
maximize the difference in stability between the probettarget hybrid and probetnon-target 
5 hybrids. 

Probes may be designed from the nucleic acids of the invention through methods 
known in the art. For instance, the G+C content of the probe and the probe length can 
affect probe binding to its target sequence. Methods to optimize probe specificity are 
commonly available in Sambrook et al, stpra^ or Ausubel et aly Short Protocols in 
10 Molecular Biology. Fourth Ed.. John Wiley & Sons, Inc., New York, 1999. 

Hybridization conditions are modified using known methods, such as those 
described by Sambrook et al and Ausubel et al as required for each probe. Hybridization 
of total cellular RNA or RNA enriched for polyA RNA can be accomplished in any 
available format. For instance, total cellular RNA or RNA enriched for polyA RNA can 

15 be affixed to a solid support and the solid support exposed to at least one probe 
comprising at least one, or part of one of the sequences of the invention under conditions 
in which the probe will specifically hybridize. Alternatively, nucleic acid fragments 
comprising at least one, or part of one of the sequences of the invention can be affixed to a 
solid support, such as a silicon chip, porous glass wafer or membrane. The solid support 

20 can then be exposed to total cellular RNA or polyA RNA from a sample under conditions 
in which the affixed sequences will specifically hybridize. Such solid supports and 
hybridization methods are widely available, for example, those disclosed by Beattie, 
(1995) WO 95/11755. By examining for the ability of a given probe to specifically 
hybridize to an RNA sample from an untreated cell population and from a cell population 

25 exposed to the agent, agents which up- or down-regulate the expression of a nucleic acid 
encoding the protein having the sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16 are 
identified. 
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Hybridization for qualitative and quantitative analysis of mRNAs may also be 
carried out by using a RNase Protection Assay (i.e., RPA, see Ma et al. (1996), Methods 
10: 273-238). Briefly, an expression vehicle comprising cDNA encoding the gene product 
and a ph£^e specific DNA dependent RNA polymerase promoter (e.g., T7, T3 or SP6 

5 RNA polymerase) is linearized at the 3* end of the cDNA molecule, downstream from the 
phage promoter, wherem such a linearized molecule is subsequently used as a template for 
synthesis of a labeled antisense transcript of the cDNA by in vitro transcription. The 
labeled transcript is then hybridized to a mixture of isolated RNA (i.e., total or 
fractionated mRNA) by incubation at AS^'C overnight in a buffer comprising 80% 

10 formamide, 40 mM Pipes, pH 6.4, 0.4 M NaCl and 1 mM EDTA. The resulting hybrids 
are then digested in a buffer comprising 40 |ig/ml ribonuclease A and 2 \ig/ml 
ribonuclease. After deactivation and extraction of extraneous proteins, the samples are 
loaded onto urea/polyacrylamide gels for analysis. 

In another assay, to identify agents which affect the expression of the instant gene 
15 products, cells or cell lines are first identified which express the gene products of the 
invention physiologically. Cells and/or cell lines so identified would be expected to 
comprise the necessary cellular machinery such that the fidelity of modulation of the 
transcriptional apparatus is maintained with regard to exogenous contact of agent with 
appropriate surface transduction mechanisms and/or the cytosolic cascades. Further, such 
20 cells or cell lines would be transduced or transfected with an expression vehicle (e.g., a 
plasmid or viral vector) construct comprising an operable non-translated 5* promoter- 
containing end of the structural gene encoding the instant gene products fused to one or 
more antigenic fragments, which are peculiar to the instant gene products, wherein said 
fragments are under the transcriptional control of said promoter and are expressed as 
25 polypeptides whose molecular weight can be distinguished from the naturally occurring 
polypeptides or may further comprise an immunologically distinct tag or other detectable 
marker. Such a process is well known in the art (see Sambrook et aL, supra). 
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Cells or cell lines transduced or transfected as outlined above are then contacted 
with agents under £5)propriate conditions. For example, the agent in a pharmaceutically 
acceptable excipient is contacted witii cells in an aqueous physiological buffer such as 
phosphate buffered saline OPBS) at physiological pH, Eagles balanced salt solution QBSS) 

5 at physiological pH, PBS or BSS comprismg serum or conditioned media comprismg PBS 
or BSS and/or serum incubated at 37^C. Said conditions may be modulated as deemed 
necessary by one of skill in the art. Subsequent to contacting the cells witii the agent, said 
cells will be disrupted and the polypeptides of the lysate are fractionated such that a 
polypeptide fraction is pooled and contacted with an antibody to be fiirther processed by 

10 immunological assay (e.g., ELISA, immunoprecipitation or Western blot). The pool of 
proteins isolated from the "agent-contacted" sample will be compared with a control 
sample where only the excipient is contacted with the cells and an increase or decrease in 
the immunologically generated signal from the "agent-contacted" sample compared to the 
control will be used to distinguish the effectiveness of the agent. 

15 H. Methods to Identify Agents that Modulate the Level or at Least One Activity of 
the Cancer Associated Proteins 

Another embodiment of the present invention provides methods for identifymg 
agents that modulate the level or at least one activity of a protein of the invention such as 
the protein having the ammo acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16- 
20 Such methods or assays may utilize any means of monitoring or detecting the desired 
activity and are particularly useful for identifying agents that treat cancer. 

In one format, the relative amounts of a protein of the invention between a cell 
population that has been exposed to the agent to be tested compared to an un-exposed 
control cell population may be assayed. In this format, probes such as specific antibodies 
25 are used to monitor the differential expression of the protein m the different cell 
populations. Cell lines or populations are exposed to the agent to be tested under 
fi^propriate conditions and tune. Cellular lysates may be prepared from the exposed cell 
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line or population and a control, unexposed cell line or population. The cellular lysates 
are then analyzed with the probe. 

Antibody probes are prepared by immunizing suitable mammalian hosts in 
appropriate immunization protocols using the peptides, polypeptides or proteins of the 

5 mvention if they are of sufficient length, or, if desired, or if required to enhance 
immunogenicity, conjugated to suitable carriers. Methods for preparing immunogenic 
conjugates with carriers such as BSA, KLH, or other carrier proteins are well known in the 
art. In some cuxumstances, dhect conjugation using, for example, carbodiimide reagents 
may be effective; in other mstances linkmg reagents such as those supplied by Pierce 

10 Chemical Co. (Etockfotd, EL), may be desuable to provide accessibility to the hapten. The 
hapten peptides can be extended at either the ammo or carboxy terminus with a cysteine 
residue or interspersed with cysteme residues, for example, to facilitate linking to a carrier. 
Administration of the immunogens is conducted generally by mjection over a suitable 
tune period and with use of suitable adjuvants, as is generally understood in the art. 

15 During the unmunization schedule, titers of antibodies are taken to determine adequacy of 
antibody formation. 

While the polyclonal antisera produced in this way may be satisfactory for some 
applications, for pharmaceutical compositions, use of monoclonal preparations is preferred. 
Immortalized cell lines which secrete the desired monoclonal antibodies may be prepared 

20 using the standard method of Kohler and Milstein ((1975) Nature 256: 495-497) or 
modifications which effect inmiortalization of lymphocytes or spleen cells, as is generally 
known. The immortalized cell Imes secreting the desired antibodies are screened by 
unmunoassay m which the antigen is the peptide hapten, polypeptide or protein. When 
the appropriate immortalized cell culture secreting the desired antibody is identified, the 

25 cells can be cultured either in vitro or by production in ascites fluid. 

The destfed monoclonal antibodies are then recovered from the culture supernatant 
or from die ascites supernatant. Fragments of tiie monoclonal antibodies or tiie polyclonal 
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antiseia which contain the immunologically significant (antigen-binding) portion can be 
used as antagonists, as well as the intact antibodies. Use of immunologically reactive 
(antigen-binding) antibody fragments, such as the Fab, Fab*, or F(ab')2 fragments is often 
preferable, especially in a ther^eutic context, as these fragments are generally less 
5 immunogenic than the whole immunoglobulin. 

The antibodies or antigen-binding fragments may also be produced, using current 
technology, by recombinant means. Antibody regions that bind specifically to the desired 
regions of the protein can also be produced in the context of chimeras with multiple 
species origin, such as humanized antibodies. 

1 0 Agents that are assayed m the above method can be randomly selected or rationally 

selected or designed. As used herein, an agent is said to be randomly selected when the 
agent is chosen randomly without considering the specific sequences mvolved in the 
association of a protein of the mvention alone or with its associated substrates, bmding 
partners, etc. An example of randomly selected agents is the use a chemical library or a 

1 5 peptide combinatorial library, or a growth broth of an organism. 

As used herein, an agent is said to be rationally selected or desig?ied when the 
agent is chosen on a nonrandom basis which takes mto account tiie sequence of the target 
site and/or its conformation in connection with tiie agent's action. Agents can be 
rationally selected or rationally designed by utilizing tiie peptide sequences tiiat make up 
20 these sites. For example, a rationally selected peptide agent can be a peptide whose amino 
acid sequence is identical to or a derivative of any fimctional consensus site. 

The agents of the present invention can be, as examples, peptides, small molecules, 
vitamin derivatives, as well as carbohydrates. Dominant negative proteins, DNAs 
encodmg these proteins, antibodies to these proteins, peptide fragments of these protems 
25 or mimics of these protems may be mtroduced mto cells to affect fimction. "Mimic" used 
herein refers to tiie modification of a region or several regions of a peptide molecule to 
provide a structure chemically different from the parent peptide but topographically and 
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fiuictionally similar to the parent peptide (see Grant in: Molecular Biology and 
Biotechnolopv> Movers, ed., pp. 659-664, VCH Publishers, Inc., New York, 1995). A 
skilled artisan can readily recognize that there is no limit as to the structural nature of the 
agents of the present invention. 

5 The peptide agents of the invention can be prepared using standard solid phase (or 

solution phase) peptide synthesis methods, as is known in the art. In addition, the DNA 
encoding these peptides may be synthesized using commercially available oligonucleotide 
synthesis instrumentation and produced recombinantly using standard recombinant 
production systems. The production using solid phase peptide synthesis is necessitated if 

10 non-gene-encoded amino acids are to be included. 

Another class of agents of the present invention are antibodies inununoreactive 
with critical positions of proteins of the invention, e.g., cytoplasmic domain, spacer 
domain, a-helical coiled-coil domain, or the receptor domain, as described herein. 
Antibody agents are obtained by immunization of suitable manmialian subjects with 
15 peptides, containing as antigenic regions, those portions of the protein intended to be 
targeted by the antibodies. 

L Uses for Agents that Modulate the Expression or at Least one Activity of the 
Proteins Associated with Cancer 

As provided in the Examples, the proteins and nucleic acids of the invention, such 
20 as the protems havmg the amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16, 
are differentially expressed in cancerous tissue. Agents that up- or down- regulate or 
modulate the expression of the protein or at least one activity of the protein, such as 
agonists or antagonists, may be used to modulate biological and pathologic processes 
associated with the protem's function and activity. This mcludes agents identified 
25 employing homologues and analogues of the present invention. 
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As used herein, a subject can be any mammal, so long as the mammal is in need of 
modulation of a pathological or biological process mediated by a protein of the invention. 
The term "mammal" is defined as an individual belonging to the class Mammalia. The 
invention is particularly useful in the treatment of human subjects. 

5 Pathological processes refer to a category of biological processes which produce a 

deleterious effect. For example, expression of a protein of the invention may be 
associated with cell growth or hyperplasia. As used herein, an agent is said to modulate a 
pathological process when the agent reduces the degree or severity of the process. For 
instance, cancer may be prevented or disease progression modulated by the administration 

10 of agents which up- or down-regulate or modulate in some way the expression or at least 
one activity of a protem of the invention. 

The agents of the present invention can be provided alone, or in combination with 
other agents that modulate a particular pathological process. For example, an agent of the 
present invention can be administered in combination with other known drugs. As used 
15 herein, two agents are said to be administered in combination when the two agents are 
administered simultaneously or are administered independently in a fashion such that the 
agents will act at the same time. 

The agents of the present invention can be administered via parenteral, 
subcutaneous, intravenous, intramuscular, intraperitoneal, transdermal, or buccal routes. 
20 Alternatively, or concurrently, administration may be by the oral route. The dosage 
administered will be dependent upon the age, health, and weight of the recipient, kind of 
concurrent treatment, if any, firequency of treatment, and the nature of the effect desired. 

The present invention further provides compositions contsuning one or more agents 
which modulate expression or at least one activity of a protein of the invention. While 
25 individual needs vary, determination of optimal ranges of efifective amounts of each 
component is witiiin the skill of the art. Typical dosages comprise 0.1 to 100 fig/kg body 
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wt. The preferred dosages comprise 0,1 to 10 ng/kg body wt The most preferred dosages 
comprise 0.1 to 1 )J.g/kg body wt 

In addition to the pharmacologically active agent, the compositions of the present 
invention may contain suitable pharmaceutically acceptable carriers comprising excipients 

5 and auxiliaries which facilitate processmg of the active compounds into preparations 
which can be used pharmaceutically for delivery to the site of action. Suitable 
formulations for parenteral administration include aqueous solutions of the active 
compounds in water-soluble form, for example, water-soluble salts. In addition, 
suspensions of tiie active compounds as appropriate oily injection suspensions may be 

10 admmistered. Suitable lipophilic solvents or vehicles include fatty oils, for example, 
sesame oil, or synthetic fetty acid esters, for example, ethyl oleate or triglycerides. 
Aqueous injection suspensions may contain substances which increase the viscosity of the 
suspension include, for example, sodium carboxymethyl cellulose, sorbitol, and/or dextran. 
Optionally, the suspension may also contain stabilizers. Liposomes can also be used to 

1 5 encapsulate the agent for delivery mto the cell. 

The pharmaceutical formulation for systemic administration accordmg to the 
invraition may be formulated for enteral, parenteral or topical administration. Indeed, all 
three types of formulations may be used simultaneously to achieve systemic 
administration of the active ingredient 

20 Suitable formulations for oral administration include hard or soft gelatin capsules, 

pills, tablets, including coated tablets, elbdrs, suspensions, syrups or inhalations and 
controlled release forms thereof. 

In practicing the metiiods of tiiis invention, the compounds of this invention may 
be used alone or in combination, or in combination with other therapeutic or diagnostic 
25 agents. In certam preferred embodiments, the compovmds of this invention may be 
coadministered along witii otiier compounds typically prescribed for these conditions 
according to generally accepted medical practice. The compounds of this invention can be 
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Utilized in vivo, ordinarily in mammals, such as humans, sheep, horses, cattle, pigs, dogs, 
cats, rats and mice, or in vitro. 

J. Methods to Identify Binding Partners 

Another embodiment of the present invention provides methods for isolating and 
5 identifying binding partners of proteins of the invention. In general, a protein of the 
invention is mixed with a potential binding partner or an extract or fraction of a cell under 
conditions that allow the association of potential binding partners with the protein of the 
invention. After mixing, peptides, polypeptides, proteins or other molecules that have 
become associated witii a protein of the invention are separated from the mixture. The 
10 bmdmg partner that bound to the protein of the invention can then be removed and fiirther 
analyzed. To identify and isolate a binding partner, the entire protein, for mstance a 
protein comprising the entire amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 
16 can be used. Alternatively, a fragment of the protein can be used. 

As used herem, a cellular extract refers to a preparation or fraction which is made 
15 from a lysed or disrupted cell. The preferred source of ceUular extracts will be cells 
dffl-ived from human tumors or transformed cells, for instance, biopsy tissue or tissue 
culture cells from carcinomas. Alternatively, cellular extracts may be prepared from 
normal tissue or available cell lines. 

A variety of methods can be used to obtain an extract of a cell. Cells can be 
20 disrupted usmg either physical or chemical disruption methods. Examples of physical 
disruption methods include, but are not limited to, sonication and mechanical shearing. 
Examples of chemical lysis metiiods include, but are not limited to, detergent lysis and 
enzyme lysis. A skilled artisan can readily adapt methods for preparing ceUular extracts in 
order to obtain extracts for use in the present methods. 

25 Once an extract of a cell is prepared, the extract is mfaced with the protein of the 

invention under conditions in which association of tiie protein with the binding partner can 
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occur. A variety of conditions can be used, tlie most preferred being conditions that 
closely resemble conditions found in the cytoplasm of a human cell. Features such as 
osmolarity, pH, temperature, and the concentration of cellular extract used, can be varied 
to optimize the association of the protein with the binding partner. 

5 After mixing under appropriate conditions, the bound complex is separated from 

the mixture. A variety oftechniques can be utilized to separate the mixture. For example, 
antibodies specific to a protein of the invention can be used to immunoprecipitate the 
binding partner complex. Altematively, standard chemical separation techniques such as 
chromatography and density/sediment centrifugation can be used. 

10 After removal of non-associated cellular constituents found in the extract, tiie 

b'mding partner can be dissociated from the complex using conventional metiiods. For 
example, dissociation can be accomplished by altering the salt concentration or pH of the 
mixture. 

To aid in separating associated binding partner pairs from the mixed extract, the 
15 protein of the mvention can be immobilized on a solid support For example, the protein 
can be attached to a nitrocellulose matrix or acrylic beads. Attachment of the protein to a 
solid support aids in separating peptideA)inding partner pairs from other constituents found 
in the extract. The identified binding partners can be either a single protein or a complex 
made up of two or more proteins. Altematively, binding partners may be identified usmg 
20 a Far-Western assay according to the procedures of Takayama et al (1 997), Methods Mol 
Biol. 69: 171-184 or Sander et al. (1996), J. Gen. Virol 77: 991-996 or identified through 
the use of epitope tagged proteins or GST fusion proteins. 

Altematively, tiie nucleic acid molecules of tiie invention can be used in a yeast 
two-hybrid system or other in vivo protein-protein detection system. The yeast two-hybrid 
25 system has been used to identify other protein partner pairs and can readily be adapted to 
employ the nucleic acid molecules herein described. 
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K. Use of the Binding Partners of the Cancer Associated Proteins 

Once isolated, the binding partners of the proteins of the invention, and 
homologues and analogues thereof, obtained using the above described methods can be 
used for a variety of purposes. The binding partners can be used to generate antibodies 
5 that bind to the binding partner using techniques known in the art. Antibodies that bind the 
bmding partner can be used to assay the activity of the protein of the invention, as a 
therapeutic agent to modulate a biological or pathological process mediated by the protein 
of the invention, or to purify the bindmg partner. These uses are described in detail below. 

L. Methods to Identify Agents that Bloclt the Associations tietween the Binding 
1 0 Partners and the Cancer Associated Proteins 

Another embodiment of the present invention provides methods for identifymg 
agents that reduce or block the association of a protein of the invention with a binding 
partner. Specifically, a protein of the invention is mixed with a binding partner in the 
presence and absence of an agent to be tested. After mixing under conditions that allow 
15 association of the proteins, the two mbrtures are analyzed and compared to determine if 
the agent reduced or blocked the association of the protein of the invention with the 
binding partner. Agents that block or reduce the association of the protein of the invention 
with the binding partner will be identified as decreasing the amount of association present 
in the sample containing the tested agent. 

20 As used herein, an agent is said to reduce or block the association between a 

protein of the invention and a binding partner when the presence of the agent decreases the 
extent to which or prevents the binding partner from becoming associated with the protein 
of the invention. One class of agents will reduce or block the association by binding to the 
binding partner while another class of agents will reduce or block the association by 

25 binding to tiie protein of the invention. 
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The binding pcurtner used in tiie above assay can either be an isolated and fiiUy 
characterized protein or can be a partially characterized protein that binds to the protein of 
the invention or a binding partner that has been identified as being present in a cellular 
extract* It will be apparent to one of ordinary skill in the art that so long as the binding 
S partner has been characterized by an identifiable property, e.g., molecular weight, the 
present assay can be used. 

Agents that are assayed in the above method can be randomly selected or rationally 
selected or designed. As used herein, an agent is said to be randomly selected when the 
agent is chosen randomly without considering the specific sequences involved in the 
10 association of the protein of the invention with the binding partner. An example of 
randomly selected agents is the use of a chemical library or a peptide combinatorial library, 
or a growth broth of an organism. 

As used herein, an agent is said to be rationally selected or designed when the 
agent is chosen on a nonrandom basis which takes into account the sequence of the target 

15 site and/or its conformation in connection with the agenf s action. Agents can be rationally 
selected or rationally designed by utilizing the peptide sequences that make up the contact 
sites of the binding partner with the protein of the invention. For example, a rationally 
selected peptide agent can be a peptide whose amino acid sequence is identical to the 
contact site of the protein of the invention on the binding partner. Such an agent will 

20 reduce or block the association of the protein of the invention with the binding partus by 
binding to the binding partner. 

The agents of the present invention can be, as examples, peptides, small molecules, 
vitamin derivatives, as well as carbohydrates. A skilled artisan can readily recognize that 
there is no limit as to the structural nature of the agents of the present invention. 

25 One class of agents of tiie present invention are peptide agents whose amino acid 

sequences are chosen based on the amino acid sequence of the protein of the invention. 
The peptide agents of the invention can be prepared usmg standard solid phase (or 
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solution phase) peptide synthesis metiiods, as is known in the art. In addition, the DNA 
encoding these peptides may be synthesized using commercially available oligonucleotide 
synthesis instrumentation and produced recombinantiy using standard recombinant 
production systems. The production using solid phase peptide syntiiesis is necessitated if 
5 mn-geos encoded amino acids are to be included. 

Another class of agents of the present invention are antibodies immunoreactive 
with critical positions of the protein of the invention or the binding partner. As described 
above, antibodies are obtained by immunization of suitable mammalian subjects witii 
peptides, containing as antigenic regions, tiiose portions of tiie protein of the invention or 
10 tiie binding partner, intended to be targeted by tiie antibodies. Critical regions include tiie 
contact sites mvolved in tiie association of tiie protein of tiie invention witii tiie binding 
partner. 

As discussed below, the important minimal sequence of residues mvolved in 
activity of the protein of the invention define a functional linear domain that can be 
15 effectively used as a bait for two hybrid screening and identification of potential 
associated molecules. Use of such fragments will significantly increase the specificity of 
tiie screening as opposed to using tiie fuU-lengtii molecule and is tiierefore preferred. 
Similarly, this linear sequence can be also used as an alBBnity matrix also to isolate binding 
proteins using a biochemical affinity purification strategy. 

20 M. Uses for Agents that Block the Associations between the Binding Partners and 
tiie Cancer Associated Proteins 

As provided in the Examples, the proteins and nucleic acids of the invention, sudi 
as tiie proteins having the amino acid sequence of SEQ ED NO: 2, 4, 6, 8, 10, 12, 14 or 16, 
are differentially expressed in cancerous tissue. Agents that reduce or block the 
25 interactions of a protein of tiie invention, including those identified employmg 
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homologues and analogues of the protein, with a bmding partner may be used to modulate 
biological and pathologic processes associated with the protein's function and activity. 

As used herein, a subject can be any mammal, so long as the mammal is in need of 
modulation of a pathological or biological process mediated by a protein of the mvention. 
5 The term "mammal" is meant an individual belonging to the class Mammalia. The 
invention is particularly useful in the treatment of human subjects. 

Pathological processes refer to a category of biological processes which produce a 
deleterious effect. For example, expression of a protem of the invention may be 
associated with cell growth or hyperplasia. As used herein, an agent is said to modulate a 
10 pathological process when the agent reduces the degree or severity of the process. For 
instance, cancer may be prevented or disease progression modulated by the administration 
of agents that reduce or block the interactions of a protein of the invention with a binding 
partner. 

The agents of the present invention can be administered via parenteral, 
15 subcutaneous, intravenous, intramuscular, intraperitoneal, transdermal, or buccal routes. 
Alternatively, or concurrently, administration may be by the oral route. The dosage 
administered will be dependent upon the age, health, and weight of the recipi^t, Idnd of 
concurrent treatment, if any, frequency of treatment, and the nature of the effect desired. 

The present invention further provides compositions containing one or more agents 
20 that block association of a protein of the invention with a binding partner. While 
individual needs vary, determination of optimal ranges of effective amoxmts of each 
component is within the skill of the art. Typical dosages comprise 0.1 to 100 jig/kg body 
wt. The preferred dosages comprise 0.1 to 10 jig/kg body wt. The most preferred dosages 
comprise 0.1 to 1 ^g/kg body wt. 

25 In addition to the pharmacologically active agent, the compositions of the present 

invention may contain suitable pharmaceutically acceptable carriers comprising excipients 



36 



wo 2004/035789 




PCT/KR2003/002161 



and auxUiaries which fecilitate processing of the active compounds into preparations 
which can be used pharmaceutically for delivery to the site of action. Suitable 
formulations for parenteral administration include aqueous solutions of - the active 
compounds in water soluble form, for example, water soluble salts. In addition, 

5 suspensions of tiie active compounds as appropriate oily injection suspensions may be 
administered. Suitable lipophilic solvents or vehicles include fatty oils, for example, 
sesame oil, or synthetic fatty acid esters, for example, ethyl oleate or triglycerides. 
Aqueous injection suspensions may contam substances which increase the viscosity of the 
suspension include, for example, sodium carboxymetiiyl cellulose, sorbitol, and/or dextran. 

10 Optionally, the suspension may also contain stabilizers. Liposomes can also be used to 
encapsulate the agent for delivery into the cell. 

The pharmaceutical formulation for systemic administration according to the 
invention may be formulated for enteral, parenteral or topical administration. Indeed, all 
three types of formulations may be used simultaneously to achieve systemic 
1 5 admmistration of tiie active ingredient 

Suitable formulations for oral administration include hard or soft gelatin capsules, 
pills, tablets, including coated tablets, elixirs, suspensions, syrups or inhalations and 
controlled release forms thereof. 

In practicing the methods of this invention, the compounds of this invention may 
20 be used alone or in combination, or in combmation whh other therapeutic or diagnostic 
agents. In certain preferred embodiments, tiie compounds of tiiis invention may be 
coadmmistered along witii other compounds typically prescribed for tiiese conditions 
according to generally accepted medical practice. The compounds of tiiis invention can be 
utilized in vivo, ordinarily in manmials, such as humans, sheep, horses, cattle, pigs, dogs, 
25 cats, rats and mice, or in vitro. 

N. Rational Drug Design and Combinatorial Chemistry 
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The present invention further encompasses rational drug design and combinatorial 
chemistry. Those of skill will recognize appropriate methods to utilize and exploit aspects 
of the present invention in identifying compounds which can be developed for cancer 
treatment Rational drug design involving polypeptides requires identifying and defining 

5 a first peptide with which the designed drug is to interact, and using the first target peptide 
to define the requurements for a second peptide. With such requirements defined, one can 
find or prepare an appropriate peptide or non-peptide that meets all or substantially all of 
the defined requirements. Thus, one goal of rational drug design is to produce structural or 
functional analogs of biologically active polypeptides of interest or of small molecules 

10 with which they interact (e.g., agonists, antagonists, null compounds) in order to fashion 
drugs that are, for example, more or less potent forms of the ligand. (See, e.g., Hodgson 
(1991), Bio. Technology 9:19-21). Combinatorial chemistry is the science of synthesizing 
and testing compounds for bioactivity en masse, instead of one by one, the aim being to 
discover drugs and materials more quickly and inexpensively than was formerly possible. 

15 Rational drug design and combinatorial chemistry have become more intimately related in 
recent years due to the development of approaches in computer-aided protein modeling 
and drug discovery. (See e.g., US Pat. No. 4,908,773; 5,884,230; 5,873,052; 5,331,573; 
and 5,888,738). 

The use of molecular modeling as a tool for rational drug design and combinatorial 
20 chemistry has dramatically increased due to the advent of computer graphics. Not only is 
it possible to view molecules on computer screens in three dimensions but it is also 
possible to examine the interactions of macromolecules such as enzymes and receptors 
and rationally designed derivative molecules to test. (See Boorman (1992), Chem. Eng. 
News 70:18-26). A vast amount of user-fiiendly software and hardware is now available 
25 and virtually all pharmaceutical companies have computer modeling groups devoted to 
rational drug design. Molecular Simulations Inc. (www.msi.com), for example, sells 
several sophisticated programs that allow a user to start from an amino acid sequence, 
build a two or three-dimensional model of the protein or polypeptide, compare it to other 
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two and tiiree-dimensional models, and analyze the interactions of compounds, drugs, and 
peptides witii a three dimensional model in real time. Accordingly, in some embodiments 
of the Invention, software is used to compare re^ons of tiie invention protein and 
molecules that interact therewith (collectively referred to as "bmding partners" -e.g., anti- 

5 protein antibodies), and firagments or derivatives of tiiese molecules with other molecules, 
such as peptides, peptidomimetics, and chemicals, so that therapeutic interactions can be 
predicted and designed. (See Schneider (1998), Genetic Engineering News December: 
page 20; Tempczyk et ah (1997), Molecular Simulations Inc. Solutions April; and 
Butenhof (1998), Molecular Simulations Inc. Case Notes (August 1998) for a discussion 

10 of molecular modeling). 

O. Gene Therapy 

In another embodiment, genetic therapy can be used as a means for modulating 
biolo^cal and pathologic processes associated wilh the protein's ftinction and activity. 
This comprises inserting into a cancerous cell a gene construct encoding a protein 

15 comprising all or at least a portion of the sequences of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 
or 16, or alternatively a gene construct comprismg all or a portion of the non-«oding 
region of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13 or 15, operably linked to a promoter or 
enhancer element such that expression of said protein causes suppression of said cancer 
and wherein said promoter or enhancer element is a promoter or «ihan(%r elonent 

20 modulating said gene construct. 

In the constructs described, expression of said protein can be dkected from any 
suitable promoter (e.g., the human cytomegalovirus (CMV), simian virus 40 (SV40), or 
metallotiiionein promoters), and regulated by any appropriate mammalian regulatory 
element. For example, if desired, enhancers known to pref^ntially direct gene expression 
25 in neural cells, T cells, or B cells may be used to dkect the expression. The enhancers used 
could mclude, -without limitation, tiiose that are characterized as tissue or cell specific in 
their expression. Alternatively, if a genomic clone of LFGl, LFG2, LFG3, LFG4, LFG5 
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or LFG6 is used as a therapeutic construct (for example, following its isolation by 
hybridization with the nucleic acid molecule of the invention described above), regulation 
may be mediated by the cognate regulatory sequences or, if desired, by regulatory 
sequences derived from a heterologous source, including any of the promoters or 
5 regulatory elements described above. 

Insertion of the construct into a cancerous cell is accomplished in vivo, for 
example using a viral or plasmid vector. Such methods can also be applied to in vitro uses. 
Thus, the methods of the present invention are readily applicable to different forms of 
gene therapy, either where cells are genetically modified ex vivo and then administered to 
10 a host or where the gene modification is conducted in vivo using any of a number of 
suitable methods involving vectors especially suitable to such therapies. 

Retroviral vectors, adenoviral vectors, adeno-associated viral vectors, or other viral 
vectors with the appropriate tropism for cells likely to be mvolved in cancer (for example, 
epithelial cells) may be used as a gene transfer delivery system for a therapeutic gene 
15 construct. Numerous vectors useful for this purpose are generally known (Cozzi PJ, et al., 

(2002) Prostate, 53(2):95-100; Bitzer M, Lauer U., (2002) Dtsch Med Wochenschr. 

127(31-32):1623-1624; Mezzina and Danos (2002), Trends Genet. 8:241-256; Loser et al. 

(2002) Curr. Gene Ther. 2:161-171; Pfeifer and Verma (2001), Rev. Genomics Hum. 

Genet. 2:177-211). Retroviral vectors are particularly well developed and have been used 
20 in clinical settings (Anderson et al. (1995), U.S. Patent No. 5,399,346). Non-viral 

approaches may also be employed for the mtroduction of therapeutic DNA into cells 

otherwise predicted to undergo cancer (Jeschke et al. (20002) Curr. Gene Ther. 1:267- 

278; Wu et al. (1988), J. Biol Chem. 263:14621-14624; Wu et al. (1989), J. Biol. Chem. 

264:16985-16987). For example, a gene may be mtroduced into a neuron or a T cell by 
25 lipofection, asialorosonucoid polylysine conjugation, or, less preferably, microinjection 

under surgical conditions. 
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For any of the methods of application described above, the therapeutic nucleic acid 
construct is preferably applied to the site of the cancer event (for example, by injection). 
However, it may also be applied to tissue in the vicinity of the cancer event or to a blood 
vessel supplying tiie cells predicted to undergo cancer. 

5 P. Transgenic Animals 

Transgenic animals containing mutant, knock-out or modified genes corresponding 
to the cDNA sequence of SEQ ID NO: 1. 3, 5, 7, 9, 11, 13 or 15, or the open reading 
ftame encodmg the polypeptide sequence of SEQ ID NO; 2, 4, 6, 8, 10, 12, 14 or 16, or 
fragmraits tiiereof having a consecutive sequence of at least about 3, 4, 5, 6, 10, 15, 20, 25, 

10 30, 35 or more amino acid residues, are also included in the invention. Transgenic 
animals are geneticaUy modified animals into which recombinant, exogenous or cloned 
genetic material has been experunentally transferred. Such genetic material is often 
referred to as a 'transgene." The nucleic acid sequence of the transgene, in this case a 
form of SEQ ID NO: 1, 3, 5. 7, 9. 11, 13 or 15, may be integrated either at a locus of a 

1 5 genome where tiiat particular nucleic acid sequence is not otherwise normally found or at 
the normal locus for the transgene. The transgene may consist of nucleic.acid sequences 
derived from the genome of the same species or of a different species than the species of 
the target animal. 

In some embodiments, transgemc animals in which all or a portion of a gene 
20 comprising SEQ ID NO: 1, 3, 5, 7, 9, 1 1. 13 or 15 is deleted may be constructed. In those 
cases where the gene corresponding to SEQ ID NO: 1. 3, 5, 7, 9, 1 1, 13 or 15 contains one 
or more inttons, the entire gene- all exons, introns and the regulatory sequences- may be 
deleted. Alternatively, less than the entire gene may be deleted. For example, a single 
exon and/or intron may be deleted, so as to create an animal expressing a modified version 
25 of a protein of the invention. 
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The tenn "germ cell line transgenic animal" refers to a transgenic animal in which 
tiie genetic alteration or genetic information was mtroduced into a germ line cell, thereby 
conferring tiie ability of the transgenic animal to transfer the genetic mformation to 
offspring. If such offepring in feet possess some or aU of that alteration or genetic 
5 information, tiien they too are transgenic animals. 

The alteration or genetic information may be foreign to tiie species of animal to 
which the recipient belongs, foreign only to tiie particular individual recipient, or may be 
genetic information akeady possessed by the recipient. In tiie last case, the altered or 
introduced gene may be e:iq>ressed differentiy than the native gene. 

10 Transgenic animals can be produced by a variety of different methods including 

transfection, electroporation. microinjection, gene targeting in embryonic stem cells and 
recombinant viral and retroviral infection (see. e.g., U.S. Patent No. 4,736,866; U.S. 
Patent No. 5,602,307; Mullins et al. (1993). Hypertension 22: 630-633; Brenin et al. 
(1997), Surg. Oncol. 6: 99-110; Recombinant Gene Expre ssion Protocols (Methods in 

15 Molecular Binlofv. Vol. 62\ Tuan, ed., Humana Press, Totowa, NJ, 1997). 

A number of recombinant or transgenic mice have been produced, mduding tiiose 
which express an activated oncogene sequence (U.S. Patent No. 4,736,866); express 
simian SV40 T-antigen (U.S. Patent No. 5,728,915); lack tiie expression of interferon 
regulatory factor 1 (ERF-l) (U.S. Patent No. 5,731,490); exhibit dopaminergic dysfunction 

20 (U.S. Patent No, 5,723,719); express at least one human gene which participates in blood 
pressure control (U.S. Patent No. 5.731,489); display greater similarity to tiie conditions 
existing in naturally occurrmg Alzheimer's disease (U.S. Patent No. 5,720,936); have a 
reduced capacity to mediate cellular adhesion (U.S. Patent No. 5,602,307); possess a 
bovine growtii hormone gene (Clutter et al. (1996), Genetics 143: 1753-1760); or, are 

25 capable of generating a fully human antibody response (McCartiiy (1997), Lancet 349: 
405). 
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While mice and rats remain the animals of choice for most transgenic 
e3q)erimentation, in some instances it is preferable or even necessary to use alternative 
animal species. Transgenic procedures have been successfully utilized in a variety of non- 
murine animals, including sheep, goats, pigs, dogs, cats, monkeys, chimpanzees, hamsters, 
5 rabbits, cows and guinea pigs (see, e.g., Kim et al (1997), Mol Reprod. Dev. 46: 515-526; 
Houdebine (1995), Reprod. Nutr. Dev. 35: 609-617; Fetters (1994), Reprod. Fertil Dev. 6: 
643-645; Schnieke et al. (1997), Science 278: 2130-2133; and Amoah (1997), J. Animal 
Sci. 75: 578-585). 

The method of introduction of nucleic acid fragments into recombination 
10 competent mammalian cells can be by any method which favors co-transformation of 
multiple nucleic acid molecules. Detailed procedures for producing transgenic animals 
are readily available to one skilled in the art, including the disclosures in U.S. Patent No. 
5,489,743 and U.S. Patent No. 5,602,307. 

Q. Diagnostic Methods 

15 As the genes and proteins of the invention are diflferentially expressed in cancerous 

tissues compared to non-cancerous tissues, the genes and proteins of the invention may be 
used to diagnose or monitor cancer, to track disease progression, or to differentiate 
cancerous tissue from non-cancerous tissue samples. One means of diagnosing cancer 
usmg the nucleic acid molecules or proteins of the invention involves obtmning tissue 

20 from living subjects. 

Assays to detect nucleic acid or protein molecules of the invention may be in any 
available format. Typical assays for nucleic acid molecules include hybridization or PGR 
based formats. Typical assays for the detection of proteins, polypeptides or peptides of 
the invention include the use of antibody probes in any available format such as in situ 
25 binding assays, etc. (see Harlow & Lane, Antibodies - A Laboratorv Manual, Cold Spring 
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Harbor Laboratory Press, Cold Spring Harbor, NY, 1988). In preferred embodiments, 
assays are carried-out with appropriate controls. 

Generally, the diagnostics of the invention can be classified according to whether 
the embodiment is a nucleic acid or protein-based assay. Some diagnostic assays detect 

5 mutations or polymorphisms in the invention nucleic acids or proteins, which contribute to 
cancerous aberrations. Other diagnostic assays identify and distinguish defects in protein 
activity by detecting a level of invention RNA or protein in a tested organism that 
resembles the level of invention RNA or protein in a organism suffering from a disease, 
such as cancer, or by detecting a level of RNA or protein in a tested organism that is 

1 0 different than an organism not suffering from a disease. 

Additionally, the manufacture of kits that mcorporate the reagents and methods 
described in the following embodiments so as to allow for the rapid detection and 
identification of aberrations in protein activity or level are contemplated. The diagnostic 
kits can include a nucleic acid probe or an antibody or combinations thereof, which 

15 specifically detect a mutant form of the invention protein or a nucleic acid probe or an 
antibody or combinations thereof, which can be used to determine the level of RNA or 
protein expression of one or more invention protein. The detection component of these 
kits will typically be supplied in combination with one or more of the following reagents. 
A support capable of absorbing or otfierwise binding DNA, RNA, or protein will often be 

20 supplied. Available supports include membranes of nitrocellulose, nylon or derivatized 
nylon that can be characterized by bearing an array of positively charged substituents. One 
or more restriction enzymes, control reagents, buffers, amplification enzymes, and non- 
human polynucleotides like calf-thymus or salmon-sperm DNA can be supplied in these 
kits. 

25 Usefiil nucleic acid-based diagnostic techniques include, but are not limited to, 

direct DNA sequencing, gradient gel electrophoresis, Southern Blot analysis, single- 
stranded confirmation analysis (SSCA), RNAse protection assay, dot blot analysis, nucleic 
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acid amplification, allele-specific PGR and combinations of these approaches. The starting 
point for these analyses is isolated or piirified nucleic acid from a biological sample. It is 
contemplated that tissue biopsies would provide a ^od sample source. The nucleic acid 
is extracted from the sample and can be amplified by a DNA amplification technique such 

5 as the Polymerase Chain Reaction (PGR) using primers. Those of skill in the art will 
readily recognize methods available for confirming the presence of polymorphisms. In 
addition, any addressable array tedmology known in the art can be employed with this 
aspect of the invention. One particular embodhnent of polynucleotide arrays is known as 
Genechips™, and has been generally described in US Patent 5,143,854; PGT publications 

10 WO 90/15070 and 92/10092. 

A wide variety of labels and conjugation techniques are known by those skilled m 
the art and can be used in various nucleic acid assays. There are several ways to produce 
labeled nucleic acids for hybridization or PGR including, but not limited to, oligolabelmg, 
nick translation, end-labeling, or PGR amplification using a labeled nucleotide. 

15 Alternatively, a nucleic acid encoding an invaition protein can be cloned into a vector for 
the production of an mRNA probe. Such vectors are known in the art, are commercially 
available, and can be used to sfynthesize RNA probes in vitro by addition of an appropriate 
RNA polymerase such as T7, T3 or SP6 and labeled nucleotides. A number of companies 
such as Pharmacia Biotech (Piscataway, NJ), Promega (Madison, WI), and U.S. 

20 Biochemical Corp (Cleveland, OH) supply commercial kits and protocols for these 
procedures. Suitable reporter molecules or labels include those radionuclides, enzymes, 
fluorescent, chemiluminescent, or chromogenic agents, as well as, substrates, cofactors, 
inhibitors, magnetic particles and the like. 

In preferred protein-based diagnostic, antibodies of the mvention are attached to a 
25 support in an ordered array wherein a plurality of antibodies are attached to distinct 
regions of the support fliat do not overlap with each other. Those of skill m the art will 
readily recognize available assays that are protein-based diagnostics. Proteins are 
obtamed from biological samples and are labeled by conventional approaches (e.g., 
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radioactivity, colorimetrically, or fluorescently). Employing labeled standards of a known 
concentration of mutant and/or wild-type invention protein, an investigator can accurately 
determine the concentration of the invention protein in a sample and from this information 
can assess the expression level of the particular form of the protein. Conventional methods 

5 in densitometry can also be used to more accurately determine the concentration or 
expression level of such protein. These approaches are also easily automated using 
technology known to tiiose of skill in the art of high throughput diagnostic analysis. As 
detailed above, any addressable array technology known in the art can be employed with 
this aspect of the invention and display the protein arrays on the chips m an attempt to 

10 maximize antibody binding patterns and diagnostic information. 

As discussed above, the presrace or detection of a polymorphism in an invention 
gene or protein can provide a diagnosis of a cancer or similar malady in an organism. 
Additional embodiments include the preparation of diagnostic kits comprising detection 
components, such as antibodies, specific for a particular polymorphic variant of invention 

15 gene or protem. The detection component will typically be supplied in combination with 
one or more of the following reagents. A support capable of absorbmg or otherwise 
binding RNA or protein will often be supplied. Available supports for this purpose include, 
but are not limited to, membranes of nitrocellulose, nylon or derivatized nylon that can be 
characterized by bearing an array of positively charged substituents, and Genechips™ or 

20 their equivalents. One or more enzymes, such as Reverse Transcriptase and/or Taq 
polymerase, can be ftimished in the kit, as can dNTPs, buffers, or non-human 
polynucleotides like calf-thymus or saUnon-sperm DNA. Results from the kit assays can 
be interpreted by a healthcare provider or a diagnostic laboratory. Alternatively, diagnostic 
kits are manufactured and sold to private individuals for self-diagnosis. 

25 In addition to diagnosing disease according to the presence or absence of a 

polymorphism, some diseases involving cancer result from skewed levels of invention 
protein or gene in particular tissues or aberrant patterns of invention protein expression. 
By monitoruag the level of expression in various tissues, for example, a diagnosis can be 
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made or a disease state can be identified. Similarly, by determining ratios of the level of 
expression of various invention proteins in specific tissues (e.g., patterns of expression) a 
prognosis of health or disease can be made. The levels of invention protein expression in 
various tissues from healthy individuals, as well as, individuals suffering from cancers is 
5 determined. These values can be recorded in a database and can be compared to values 
obtained from tested individuals. Additionally, the ratios or patterns of expression in 
various tissues from both healthy and diseased individuals is recorded in a database. These 
analyses are referred to as "disease state profiles" and by comparing one disease state 
profile (e.g. from a healthy or diseased individual) to a disease state profile from a tested 
10 individual, a clinician can rapidly diagnose the presence or absence of disease. 

The nucleic acid and protein-based diagnostic techniques described above can be 
used to detect the level or amount or ratio of expression of invention genes or proteins in a 
tissue. Through quantitative Northern hybridizations, in situ analysis, 
immunohistochemistry, ELISA, genechip array technology, PGR. and Westem blots, for 
15 example, the amount or level of expression of RNA or protein for a particular invention 
protein (wild-type or mutant) can be rapidly determined and firom this information ratios 
of expression can be ascertained. Alternatively, the invention proteins to be analyzed can 
be family members that are currently unknown but which are identified based on their 
possession of one or more of the homology regions described above. 

20 Without finther description, it is believed that one of ordinary skill in the art can, 

using the preceding description and the following illustrative examples, make and utilize 
the compounds of the present invention and practice the claimed methods. The following 
working examples flierefore, specifically point out preferred embodiments of the present 
invention, and are not to be construed as limiting m any way Ihe remainder of tiie 

2S disclosure. 

EXAMPLES 
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RX AMPLE 1: Identification of Differentially Expressed tnRNA in Cancers ■ 1 

Global changes in gene expression between tumor biopsies and normal tissues 
have been examined using the GeneExpress Oncology Datasuite™ of Gene Logic, Inc. 
(Gaithersburg, MD). The database includes the gene expression profiles, generated by 
5 using the Aflfymetrix Human Genome U95 array, derived from normal and cancer tissue 
samples from many different organs. Among the tissue samples in the database, applicants 
analyzed the expression profiles of normal and cancer tissue sets from breast, colon, 
esophagus, kidney, liver, lung, lymph node, ovary, pancreas, prostate, rectum, and 
stomach. 

10 The Affymetrix Human Genome U95 array contams 63,175 probe sets. A probe 

set is a set of probes to detect one transcript (a gene or a cDNA clone), and usually 
consists of 16-20 oligonucleotide probe pairs. These probe pairs include perfectly matched 
sets and mismatched sets, both of which are necessary for the calculation of average 
difference. Average difference serves as a relative indicator of the level of expression of a 

15 transcript and is a measure of the intensity difference for each probe pair, calculated by 
subtracting the intensity of the mismatch from the intensity of the perfect match. This 
takes into consideration variability in hybridization among probe pairs and other 
hybridization artifacts that could affect the fluorescence intensities. Using the average 
difference value that has been calculated, an absolute call for each gene is made; "Absent" 

20 (= not detected), *Tresenf ' (= detected) or '^Marginal" (= not clearly Absent or Present). 

Differential expression of genes between cancerous and normal tissue samples was 
determined with the following statistical methods. (1) For each probe set, average 
difference values and absolute calls were determined by Affymetrix Microarray Suite 
(v4.0). (2) In a given sample set, outliers among the tissue samples were detected by 
25 Principal Component Analysis (PCA) using MatLab program (The Math Works, Inc., 
Natick, MA). The data points used in the PCA were the average differences of randomly 
selected probe sets (5,000-6,000 probe sets). Outliers were excluded &om further analysis. 
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(3) Variations of gene expression were analyzed by using the Fold Change Analysis tool 
of GeneExpress program. The fold change (cancerous/normal) was calculated by 
comparing the mean average difference for each gene in a cancerous sample set against 
the mean average difference of that gene in the normal tissue sample set. Genes showing 
S at least 3-fold increases or decreases in expression level were obtained. Genes were 
included in the analysis if they had a p-value of less than or equal to 0.05 as determined by 
an Analysis of Variance Test (Steel et al.^ Principles and Procedures of Statistics: A 
Biometrical Approach, Third Ed., McGraw-Hill, 1997). (4) Genes showing differential 
expression in at least 5 different cancer types were selected. 

10 Analysis of the chip data showed that the expression of the marker LFGl was 

significantly up-regulated in cancer tissue samples compared to samples fiom normal 
tissue. The e^q^ression level of LFGl (SEQ ID NO: 1 or 3) can be measured by chip 
sequence fragment no. 91875_s_at on Aflfymetrix GeneChips® U95. The 91875_s_at 
sequence is derived from the EST AI053741. The expression levels of 91875_s _at m 

IS various malignant neoplasms, compared to normal control tissues, are shown in Table 1, 
where the fold-change, the direction of the change (up- or down-regulation), p-value are 
also indicated. The fold change (cancerous/normal) was calculated by comparing the 
geometric mean of average difference in a cancerous sample set against the geometric 
mean of average difference in the normal tissue sample set. A fold change greater than 1 .5 

20 was considered to be significant (Wodicka et ah (1997), Nature Biotech. 15:1359-1367). 
Also indicated in the Table 1 are, for each tissue type, the numbers of samples that are 
called present, absent, or marginal together with the total number of samples in that 
sample set. These data indicate that up-regulation of LFGl may be diagnostic for cancer. 
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The GeneChip expression results, detennined by sample binding to chip sequence 
fragment no. 91875_s _at, were validated by quantitative RT-PCR (Q-RT-PCR) using the 
Taqman® assay (Peridn-Elmer). PGR primers (5»-GCTGAAGCAGGAAAATCGCTT-3' 
(SEQ ID NO: 17) and 5'-TGAGACGGAGTCTCACTCGGT-3' (SEQ ID NO: 18)) 

5 designed based on the sequence information file of the specific Afifymetrix fragment 
(91875_s _at) were used in the assay. The target gene in each RNA sample (10 ng of total 
RNA) was assayed relative to a referraice gene. For this purpose, primers (5'- 
GTTTTTCCTAATnTGGCATGAAC-a' (SEQ ID NO: 19) and 5»- 
CGCCCAAGCTTTTCCTTTT-3' (SEQ ID NO: 20)) specific to the CTBPl gene (C- 

10 terminal binding protein 1) were used to serve as control primers. This approach provides 
the relative expression as measured by cycle threshold (Ct) value of the target mRNA 
relative to an amount of CTBPl Ct value. The sample panel included total KNA pairs of 
normal and tumor tissues from colon, kidney, liver, lung, ovary, stomach and pancreas 
(Ambion, Inc., Austin, TX). The Q-RT-PCR data confirms the up-regulation of LFGl in 

1 S cancer compared to normal samples. 

F.YAMPLE 2: Cloning of Full-Leneth Human cDN A (LFGU Correspondmg to 
Differentiallv Expressed mRNA Species 



The full-length cDNA having SEQ ID NO: 1 or 3 was obtamed by polymerase chain 
reaction (PCR) and rapid amplification of cDNA ends (RACE) using cDNA library from 



human heart (ResGen, Huntsville, AL). 


Gene-specific 


oligos for 


PCR 


(5'- 


CACCCTTTGCCTCTGTCACTTCCGCA-3' 


(SEQ 


ID 


NO: 


21), 


5'- 


GCTGGAGCACCAGGACTGCATTG-3' 


(SEQ 


ID 


NO: 


22), 


5'- 


GGAGCTGAGCAGCAGTGTAATGAA-3' 


(SEQ 


ID 


NO: 


23), 


5'- 


GAGGCCTGCCTGAAGGAGGAGCTTC-3' 


(SEQ 


ID 


NO: 


24), 


5'- 


TCTGGAAGTAGTGCAGACGCCTCAGG-3' 


(SEQ 


ID 


NO: 


25), 


5'- 


AGCCAACGTCGGCTTTGTTATCCAGC-3' 


(SEQ 


ID 


NO: 


26). 


5'- 
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GCTGTCAGATATGATGGTTCTGGAC-3' (SEQ ID NO: 27). 5'- 
CCAGCCTCACCACTGTTGGGTTGC-3* (SEQ ID NO: 28), 5'- 
CATTCTCTGAGCTGTATTAGTGT-3' (SEQ ID NO: 29). 5'- 
CCTGAGCTGGAATGACCTGCA-3' (SEQ ID NO: 30), 5'- 
CTTTGTGTTGGCTGCAGCCACA-3' (SEQ ID NO: 31), 5'- 
TGAGGAGAGACTTTGCTGACTGGT-3' (SEQ ID NO: 32). 5'- 
GTCCTGTCTGGCGGTGCCGA-3' (SEQ ID NO: 33), 5'- 

GCTCCAGGATCCCCTGTCACCTGGGCCTTCTGCCTTTTGGCT-3' (SEQ ID NO: 34), 
5'-CCATATGGAGAGGAGAGCAGCGGGCCCA-3' (SEQ ID NO: 35), 5'- 
GAAGGAGGAACATGGAGAGGAGA-3' (SEQ ID NO: 36), 5'- 
CCATATGCCCCGGGTAGTCTACTGCAT-3' (SEQ ID NO: 37), and 5'- 
GTCGACTCGAGTCACTTCCGCAAAAACTTCTTG-3' (SEQ ID NO: 38)) and RACE 
(5'-TCCATTCCGAAGGCTCTCCTCC-3' (SEQ ED NO: 39), 5'- 
GTCTGTGTGACGGAAATGTAAGC-3' (SEQ ID NO: 40), and 5'- 
GAAGGTCGAAGGCAGACCGATGT-3' (SEQ ID NO: 41)) were designed based on 
predicted genes containing the 91875_s_at sequence using Human Genome Browser 
(University of California, Santa Cruz). The amplified products with the primers were 
incorporated into PCR4-Topo vector using Topo Cloning System (Invitrogen, Carlsbad, 
CA), and followed by sequencing. 

The nucleotide sequence of the full-length human cDNAs corresponding to the 
differentially regulated mRNA detected above is set forth in SEQ ID NOS: 1 and 3. In the 
former, the cDNA comprises 5293 base pairs. In the latter, the cDNA comprises 5317 base 
pairs. 

An open reading frame within the cDNA nucleotide sequence of SEQ ID NO: 1, at 
nucleotides 390-4880 (390-4883 including the stop codon), encodes a protein of 1497 
amino acids. The amino acid sequence corresponding to a predicted protein encoded by 
SEQ ID NO: 1 is set forth in SEQ ID NO: 2. Figure 2 shows the results of a hydrophobicity 
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analysis of tfie amino acid sequence of SEQ ID NO: 2 using Kyte-Dooiitde values (Kyte 
and Doolittie (1982), J. Mol Biol 157:105-142). Hydrophilic regions may be used to 
produce antigenic peptides, as described above. 

An open reading frame within the cDNA nucleotide sequence of SEQ ID NO: 3, at 
5 nucleotides 12-4904 (12-4907 including the stop codon), encodes a protein of 1631 amino 
acids. The amino acid sequence corresponding to a predicted protein encoded by SEQ ID 
NO: 3 is set forth in SEQ ID NO: 4. Figure 3 shows the results of a hydrophobicity analysis 
of the amino acid sequence of SEQ ID NO: 4 using Kyte-Doolittle values (Kyte and 
Doolittie (1982), J. Mol Biol 157:105-142). Hydrophilic regions may be used to produce 
10 antigenic peptides, as described above. 

The protein sequence of SEQ ID NO: 2 is identical to that of SEQ ID NO: 4, except 
that SEQ ID NO: 2 lacks the first 134 amino acids at the N-terminus of SEQ ID NO: 4. 

SEQ ID NOS: 2 and 4 contain Calponin homology domain (amino acid positions 38- 
145 of SEQ ID NO: 4), IQ domain for calmodulin-binding (amino acid positions 629-646 of 

15 SEQ ID NO: 2 and amino acid positions 763-780 of SEQ ID NO: 4), RasGAP domam 
(amino acid positions 858-1195 of SEQ ID NO: 2 and amino acid positions 992-1329 of 
SEQ ID NO: 4), and RasGAP C-terminal domam (amino acid positions 1298-1421 of SEQ 
ID NO: 2 and amino acid positions 1432-1555 of SEQ ID NO: 4). SEQ ID NOS: 2 and 4 
are shnilar to IQGAP proteins (Weissbach et al (1994), J Biol Chem 269:20517-20521; 

20 Brill et al. (1996), Mol Cell Biol 16:4869-4878). IQGAP binds to and modulate the 
function of proteins involved in cytoskeletal structure, cell-cell adhesion, and proliferation 
signaling (Fukada et al. (2002), Cell 109: 1-20; Briggs et al (2002), J Biol Chem 277: 7453- 
7465; McCallum et al (1998), J Biol Chem 273 : 22537-22544). IQGAP 1 -deficient mice 
exhibited a significant increase in late-onset gastric hyperplasia relative to wild-type (Li et 

25 al (2000), Mol Cell Biol 20: 697-701). 



53 



wo 2004/035789 




PCT/KR2003/002161 



Analysis by Northern blot was performed to determine the size of the mRNA 
transcripts that correspond to LFGl. A Northern blot containing total KNAs from various 
human tissues was used (Human 12-Lane MTN Blot, Clontech, Palo Alto, CA), and an EST 
containing 91875_s _at sequence was radioactively labeled by the random primer method 
5 and used to probe the blot The blot was hybridized in 50% formamide, 5X SSPE, 0.1% 
SDS, 5X Denhart's solution, and 0.2 mg/ml herring sperm DNA at 42*^0 and washed with 
0.2X SSC containing 0.1% SDS at room temperature. The Northern blot showed three 
transcripts for this gene, which are approximately 7.2 kb, and 6.3 kb in size. This 
corresponds to the sizes of the LFGl clones (SEQ ID NO: 1 and 3). 

10 EXAMPLE 3: Identification of Differentially Expressed mRNA in Cancers - 2 

The process in EXAMPLE 1 was repeated except that the marker LFG2 was used 
instead of the marker LFGl. 

Analysis of the chip data showed that the expression of the marker LFG2 was 
significantly down-regulated in cancer tissue samples compared to samples from normal 

15 tissue. The expression level of LFG2 (SEQ ID NO: 5) can be measured by chip sequence 
fragment no. 8294 l_at on Afifymetrix GeneChips® U95, The 8294 l_at sequence is derived 
from the EST AJ277612. The expression levels of 8294 l_at in various malignant neoplasms, 
compared to normal control tissues, are shown m Table 2, where the fold-change, the 
direction of the change (up- or down-regulation), p-value are also indicated. The fold 

20 change (cancerous/normal) was calculated by comparing the geometric mean of average 
difference in a cancerous sample set against the geometric mean of average difference in the 
normal tissue sample set. A fold-change greater than l.S was considered to be significant 
(Wodicka et al. (1997), Nature Biotech. 15:1359-1367). Also indicated in the Table 2 are, 
for each tissue type, the numbers of samples that are called present, absent, or marginal 

25 together with the total number of samples in that sample set. These data indicate that down- 
regulation of LFG2 may be diagnostic for cancer. 
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The GeneChip expression results, determined by sample binding to chip sequence 
fiagment no. 82941_at, were validated by quantitative RT-PCR (Q-RT-PCR) using the 
Taqman® assay (Perkin-Ehner). PGR primers (5'- 

GAATGTGtCAGAGACAAGTGCAGC-S' (SEQ ID NO: 42) and 5'- 

5 TGTAGAAACTCTTGGACTAATGGAGG-3' (SEQ ID NO: 43)) designed based on the 
sequence information file of the EST containing the Affymetrix fragment (82941_at) were 
used in the assay. The target gene m each RNA sample (10 ng of total RNA) was assayed 
relative to a reference gene. For this purpose, primers (5'- 

GTTTTTCCTAATTTTGGCATGAAC-3' (SEQ ID NO: 19) and 5'- 

10 CGCCCAAGCTTTTCCTTTT-3' (SEQ ID NO: 20)) specific to the CTBPl gene (C- 
terminal bindmg protein 1) were used to serve as control primers. This approach provides 
the relative expression as measured by cycle threshold (Ct) value of the target mKNA 
relative to an amovmt of CTBPl Ct value. The sample panel included total KNA pairs of 
normal and tumor tissues from colon, liver, Ivuig, ovary, and stomach (Ambion, Inc., Austin, 

15 TX). The Q-RT-PCR data confirms the down-regulation of LFG2 in cancer compared to 
normal samples. 

EXAMPLE 4: Cloning of Full-Lenizth Human cDNA (LFG2 ^ Corresponding to 
DifFerentiallv Expressed mRNA Species 

The fiill-length cDNA having SEQ ID NO: 5 was obtained by the oligo-pulling 
20 method using the GeneTrapper assay (Life Technologies, Rockville, MD). Briefly, a gene- 
specific oligo (5'-GAATGTGTCAGAGACAAGTGCAGC-3' (SEQ ID NO: 42)) was 
designed based on the sequence of the EST containing 8294 l_at sequence. The oligo was 
labeled with biotin and used to hybridize with 5 jig of single strand plasmid DNA (cDNA 
recombinants) &om a poorly differentiated stomach adenocarcinoma library (NCI CGAP 
25 Gas4) (ResGen, Huntsville, AL) following the procedures of Sambrook et al The 
hybridized cDNAs were separated by streptavidin-conjugated beads and eluted by heating. 
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The eluted cDNA was converted to double strand plasmid DNA and used to transform E. 
coli cells (DHIOB) and the longest cDNA was screened. After positive selection was 
confirmed by PGR using gene-specific primers, the cDNA clone was subjected to DNA 
sequencing. 

5 The nucleotide sequence of the full-length human cDNAs corresponding to the 

diffijrentiaUy regulated mRNA detected above is set forth in SEQ ID NO: 5. The cDNA 
comprises 3608 base p£urs. 

An open reading frame within the cDNA nucleotide sequence of SEQ ID NO: 5, at 
nucleotides 424-1908 (424-1911 includmg the stop codon), encodes a protein of 495 amino 
10 acids. The amino acid sequence correspondmg to a predicted protein encoded by SEQ ID 
NO: 5 is set forth in SEQ ID NO: 6. 

SEQ ID NO: 6 has homology to scavenger receptors, which are involved in 
endocytosis of selected polyanionic ligands, phagocytosis of apoptotic cells and bacteria, 
cell adhesion, and development of atherosclerosis (Peiser et ah (2002), Curr. Opin. 
15 Immunol. 14:123-128; Resnick et al (1994), Drencb Biol. Sci. 19:5-8). Based on published 
studies of scavenger receptors. SEQ ID NO: 6 contwns a cytoplasmic domain (amino acid 
positions 1-35), a transmenbrane domain (amino acid positions 36-58), an a-helical coiled- 
coil domain (amino acid positions 90-301), a collagen-like domam (amino acid positions 
305-380), and a scavenger receptor cystein-rich (SRCR) domam (amino acid positions 393- 
20 493). The SRCR domain contains six cysteme residues (amino acid positions 41 8, 43 1 , 462, 
472, 482, and 492), which may participate in intradomain disulfide bonds. SEQ ID NO: 6 
also exhibits homology to a mouse homologue (GenBank Accession No. BC016096). It 
shows 70% identity over the entire contiguous sequence. 

Figure 4 shows the results of a hydrophobicity analysis of the amino acid sequence 
25 of SEQ ID NO: 6 using Kyte-Doolittle values (Kyte and DooUttle (1982), J. Mol Biol. 
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157:105-142). Hydrophilic regions may be used to produce antigenic peptides, as described 
above. 

Analysis by Northern blot was performed to determine the size of the mRNA 
transcripts that correspond to LFG2. A Northern blot containing total RNAs from various 

5 human tissues was used (Human MTN Blot, Clontech, Palo Alto, CA), and the EST 
containing 8294 l_at sequence was radioactively labeled by the random primer method and 
used to probe the blot. The blot was hybridized in 50% formamide, 5X SSPE, 0.1% SDS, 
5X Denhart's solution, and 0.2 mg/ml herring sperm DNA at 42''C and washed with 0.2X 
SSC containing 0.1% SDS at room temperature. The Northem blot showed a single 

10 transcript for this gene, which is approximately 3.7 kb in size. This corresponds to the size 
of the LFG2 clone (SEQ ID NO: 5). 

EXAMPLE 5: Identification of Differentiallv Expressed mRNA in Cancers - 3 

The process in EXAMPLE 1 was repeated except that the marker LFG3 was used 
instead of the marker LFGl. 

15 Analysis of the chip data showed that the expression of the marker LFG3 was 

significantly down-regulated in cancer tissue samples compared to samples from normal 
tissue. The expression level of LFG3 (SEQ ID NO: 7) can be measured by chip sequence 
fragment no. 46104_at on Afifymetrix GeneChips® U95. The 46104_at sequence is derived 
from the EST AA772055. The expression levels of 46104_at in various malignant 

20 neoplasms, compared to normal control tissues, are shown in Table 3, where the fold- 
change, the direction of the change (up- or down-regulation), p-value are also indicated. The 
fold change (cancerous/normal) was calculated by comparing the geometric mean of 
average difference in a cancerous sample set against the geometric mean of average 
difference in the normal tissue sample set. A fold-change greater than 1.5 was considered to 

25 be significant (Wodicka et al. (1997), Nature Biotech. 15:1359-1367). Also indicated in the 
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Table 3 are, for each tissue type, the numbers of samples that are called present, absent, or 
marginal toge&er with the total number of samples in that sample set These data indicate 
that down-regulation of LFG3 may be diagnostic for cancer. 



59 



wo 2004/035789 



PCT/iCR2003/002161 




60 



wo 2004/035789 



PCT/KR2003/002161 



The GeneChip expression results, determined by sample binding to chip sequence 
fragment no- 46104_at, were validated by quantitative RT-PCR (Q-RT-PCR) using the 
Taqman® assay (Perkin-Elmer). PGR primers (5'- 

GTATGGATCAGAATTCCCTATAGATCTTT-3' (SEQ ID NO: 44) and 5'- 

5 TAGATGTTTGGGCAACAGCCT-3* (SEQ ID NO: 45)) designed based on the sequence 
mformation file of the EST containing the A£3Eymetrix fragment (46104_at) were used in the 
assay. The target gene in each RNA sample (10 ng of total RNA) was assayed relative to a 
reference gene. For this purpose, prhners (5'-GTTTTTCCTAATTTTGGCATGAAC-3» 
(SEQ ID NO: 19) and 5*-CGCCCAAGCTTTTCCTTTT-3' (SEQ ID NO: 20)) specific to 

10 the CTBPl gene (C-terminal binding protein 1) were used to serve as control primers. This 
approach provides the relative expression as measured by cycle threshold (Ct) value of the 
target mRNA relative to an amount of CTBPl Ct value. The sample panel included total 
RNA pairs of normal and tumor tissues from colon, kidney, ovary, pancreas, and stomach 
(Ambion, Inc., Austin, TX). The Q-RT-PCR data confirms the down-regulation of LFG3 in 

1 S cancer compared to normal samples. 

EXAMPLE 6: Cloning of Full-Length Human cDNA fLFG3 ^ Corresponding to 
DiflFerentiallv Expressed mRNA Species 

The full-length cDNA having SEQ ID NO: 7 was obtained by the oligo-pulling 
method using the GeneTrapper assay (Life Technologies, Rockville, MD). Briefly, a gene- 

20 specific oligo (5'-GTATGCATCAGAATTCCCTATAGATCTTT-3' (SEQ ID NO: 44)) 
was designed based on the sequence of the EST containmg 46104_at sequence. The oligo 
was labeled with biotin and used to hybridize with 5 |ig of single strand plasmid DNA 
(cDNA recombinants) from human fetal kidney (ResGen, Huntsville, AL) following the 
procedures of Sambrook et al The hybridized cDNAs were separated by streptavidin- 

25 conjugated beads and eiuted by heating. The eluted cDNA was converted to double strand 
plasmid DNA and used to transform £. coli cells (DHIOB) and the longest cDNA was 
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screened. After positive selection was confirmed by PCR using gene-specific primers, the 
cDNA clone was subjected to DNA sequencing. The 5'-end of LFG3 was identified by 
rapid amplification of cDNA ends (RACE) using the cDNA prepared firom human fetal 
kidney (Clontech, Palo Alto, CA) and a gpne specific primer (5'- 
TTCCTTCACCAAAGGCATCCAGCCATTCTATG-3' (SEQ IDNO: 46)). 

The nucleotide sequence of the full-length human cDNAs corresponding to the 
differentially regulated mRNA detected above is set forth in SEQ ID NO: 7. The cDNA 
comprises 3162 base p£urs. 

An open reading firame within the cDNA nucleotide sequence of SEQ ID NO: 7, at 
nucleotides 405-1835 (405-1838 including the stop codon), encodes a protein of 477 amino 
acids. The amino acid sequence corresponding to a predicted protein encoded by SEQ ID 
NO: 7 is set forth in SEQ ID NO: 8. 

SEQ ID NO: 8 is similar to monocarboxylate transporters (MCTs) and contains ten 
predicted transmembrane domains (amino acids positions 10-29, 80-99. 107-128, 140-160, 
274-295, 312-332, 339-360, 363-384, 396-416, and 433-451). MCT proteins catalyze the 
fecilitated transport of monocarboxylates such as lactate, pyruvate, branched-chain oxo 
acids, ketone bodies, beta-hydroxy-butylate, and acetate (Halestrap and Price (1999), 
Biochem. J. 343:281-299). Table 4 sunrunarizes the similarity ratios of SEQ ID NO: 4 with 
the eight known monocarboxylate transporters. 



TABLE 4 . Homology of LFG3 with MCT proteins 



Protein 


Size (amino acids) 


Identity (%} 


Positives (%) 


IVICT1 


500 


17.5 


34.3 


I^CT2 


478 


19.5 


35.5 


l\/iCT3 


504 


19.5 


34.1 


MCT4 


465 


19.0 


33.2 



62 



wo 2004/035789 




PCT/KR2003/002161 



MCT5 


487 


22.1 


36.9 


MCT6 


505 


16.4 


31.5 


MCT7 


523 


20.1 


35.2 


MCT8 


613 


15.9 


27.9 



Figure 5 shows the results of a hydrophobicity analysis of the amino acid sequence 
of SEQ ID NO: 8 using Kyte-Doolittle values (Kyte and Doolittle (1982), X Mol Biol 
157:105-142). Hydrophilic regions may be used to produce antigenic peptides, as described 
5 above. 

Analysis by Nordiem blot was performed to determine the size of the mRNA transcripts that 
correspond to LFG3, A Northern blot containing total RNAs from various human tissues 
was used (Human 12-Lane MTN Blot, Clontech, Palo Alto, CA), and the EST containing 
46104_at sequence was radioactively labeled by the random primer method and used to 
10 probe the blot. The blot was hybridized in 50% formamide, 5X SSPE, 0.1% SDS, 5X 
Denhart's solution, and 0.2 mg/ml herring sperm DNA at A2^C and washed with 0.2X SSC 
containing 0, 1% SDS at room temperature. The Northern blot showed a single transcript for 
this gene, which is approximately 4.2 kb in size. This corresponds to the size of the LFG3 
clone (SEQ ID NO: 7). 

15 EXAMPLE 7: Identification of Differentially Expressed mRNA in Cancers - 4 

The process in EXAMPLE 1 was repeated except that the marker LFG4 was used 
instead of the marker LFGl. 

Analysis of the chip data showed that the expression of the marker LFG4 was 
significantly down-regulated in cancer tissue samples compared to samples from normal 
20 tissue. The expression level of LFG4 (SEQ ID NO: 9) can be measured by chip sequence 
fragment no. 62158_at on Affymetrix GeneChips® U95. The 622158_at sequence is 
derived from the EST An23S32. The expression levels of 62158_at in various malignant 
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neoplasms, compared to nonnal control tissues, are shown in Table S, where the fold- 
change, the direction of tiie change (up- or down-regulation), p-value are also indicated. The 
fold change (cancerous/normal) was calculated by comparing the geometric mean of 
average difference in a cancerous sample set against the geometric mean of average 
difference in the normal tissue sample set. A fold-change greater than l.S was considered to 
be significant (Wodicka et al. (1997), Nature Biotech. 15:1359-1367). Also mdicated in the 
Table S are, for each tissue type, the numbers of samples that are called present, absent, or 
marginal together with the total number of samples in that sample set. These data indicate 
that down-regulation of LFG4 may be diagnostic for cancer. 
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The GeneChip expression results, determined by sample binding to chip sequence 
fragment no. 62158_at, were validated by quantitative RT-PCR (Q-RT-PCR) using the 
Taqman® assay (Perkin-Elmer). PGR primers (5'- 

AAATGTCTGATTACCCCATTTTATCAGT-3' (SEQ ID NO: 47) and 5». 

5 TAATCCTGAAATGAACAGCTAACA-30 (SEQ ID NO: 48) designed based on the 
sequence information file of the EST containing the Affymetrix fragment (62158_at) were 
used in the assay. The target gene in each RNA sample (10 ng of total RNA) was assayed 
relative to a reference gene. For this purpose, primers (5*- 

GTTTTTCCTAATTTTGGCATGAAC-3' (SEQ ID NO: 19) and 5*- 

10 CGCCCAAGCTTTTCCTTTT-3* (SEQ ID NO: 20)) specific to the CTBPl gene (C- 
terminal binding protein 1) were used to serve as control primers. This approach provides 
the relative expression as measured by cycle threshold (Ct) value of the target mRNA 
relative to an amount of CTBPl Ct value. The sample panel included total RNA pairs of 
normal and tumor tissues from colon, liver, lung, ovary, pancreas, and stomach (Ambion, 

15 Inc., Austin, TX). The Q-RT-PCR data confirms the down-regulation of LFG4 in cancer 
compared to normal samples. 

EXAMPLE 8: Cloning of Full-Length Human cDNA rLFG4 ^ Corresponding to 
DiflFerentiallv Expressed mRNA Species 

The fiiU-length cDNA having SEQ ID NO: 9 was obtained by rapid amplification of 
20 cDNA ends (RACE). Briefly, gene-specific oligos (5'- 
TAATGTTAGAGTAACAGCATTTTCCTTCAA-3' (SEQ ID NO: 49) and 5'- 
TGCCCCACACTAACTCAGTTCTTGTGATG-3' (SEQ ID NO: 50)) were designed based 
on the sequence of the EST contaming 62158_at sequence. The oligos was used for PCR 
amplification of the cDNAs prepared from himian brain (Clontech, Palo Alto, CA). The 
25 amplified products with the primers were incorporated into PCR4-Topo vector using Topo 
Cloning System (Invitrogen, Carlsbad, CA), and followed by sequencing. 
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The nucleotide sequence of the full-lraigth human cDNAs corresponding to the 
differentially regulated mRNA detected above is set forth in SEQ ID KO: 9. The cDNA 
comprises 4891 base pairs. 

An open reading frame within the cDNA nucleotide sequence of SEQ ID NO: 9, at 
5 nucleotides 89-1150 (89-1153 including the stop codon), encodes a protein of 354 amino 
acids. The amino acid sequence corresponding to a predicted protein encoded by SEQ ID 
NO: 9 is set forth in SEQ ID NO: 10. 

SEQ ID NO: 10 is similar to rat Kilon and chicken Neurotractm (Funatsu et al. 

{1999), J Biol Chem 274:8224-8230; Marg et al. (1999), J Cell Biol 145:865-876). Protein 
10 sequence analysis reveals a secretory signal peptide (anuno acid positions 1-33), three 

immunoglobulin domains (amino acid positions 47-136, 145-208, and 231-312), and six 

putative //-linked glycosylation sites (amino acid positions 73, 155, 275, 286, 294, and 307). 

Kilon/Neurotractin is a member of IgLON subfamily of the immunoglobulin superfamily. 

IgLONs are a family of glycosylphosphatidylinositol (GPr)-linked cell adhesion molecules 
15 which are thought to modify neurite outgrowth and might play a role in cell-cell adhesion 

and recognition (Miyate et al. (2000), J Comparative Neurol 424:74-85). 

Figure 6 shows the results of a hydrophobicity analysis of the ammo acid sequence 
of SEQ ID NO: 10 using Kyte-Doolittle values (Kyte and Doolittle (1982), J. Mol. Biol. 
157:105-142). Hydrophilic regions may be used to produce antigenic peptides, as described 
20 above. This hydropathy plot shows the presence of hydrophobic region at the C-termmus. In 
case of GPI-anchored proteins, the addition of the GPI anchor is known to occur after the 
cleavage of the C-terminal hydrophobic region. A putative GPI anchor attachment site was 
found (Gly at the amino acid position 324). 

Analysis by Norfliem blot was performed to determine the size of the mRNA 
25 transcripts that correspond to LFG4. A Northern blot containing total RNAs from various 
human tissues was used (Human 12-Lane MTN Blot, Clontech, Palo Alto, CA), and fee 
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EST containing 621S8_at sequence was radioactively labeled by the random primer method 
and used to probe the blot. The blot was hybridized in 50% formamide, 5X SSPE, 0.1% 
SDS, 5X Denhart's solution, and 0.2 mg/ml herring sperm DNA at 42^*0 and washed with 
0.2X SSC containmg 0.1% SDS at room temperature. The Northern blot showed a single 
5 transcript for this gene, which is approximately 5.4 kb in size. This corresponds to the size 
of the LFG4 clone (SEQ ID NO: 9). 

EXAMPLE 9: Identification of Differentially Expressed mRNA in Cancers - 5 

The process in EXAMPLE 1 was repeated except that the marker LFG5 was used 
instead of the marker LFGl . 

10 Analysis of the chip data showed that the expression of the marker LFG5 was 

significantly down-regulated in cancer tissue samples compared to samples fi*om normal 
tissue. The expression level of LFG5 (SEQ ID NO: 11) can be measured by chip sequence 
firagment no. 46659_at on Affymetrix GeneChips® U95. The expression levels of 46659_at 
in various malignant neoplasms, compared to normal control tissues, are shown in Table 6, 

15 where the fold-change, the direction of the change (up- or down-regulation), p-value are 
also indicated. The fold change (cancerous/normal) was calculated by comparing the 
geometric mean of average difference in a cancerous sample set against the geometric mean 
of average difference in the normal tissue sample set. Also indicated in the Table 6 are, for 
each tissue type, the numbers of samples that are called present, absent, or marginal together 

20 with the total number of samples in that sample set. These data mdicate that differential 
regulation of LFG5 may be diagnostic for cancer. 
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The GeneChip expression results, determined by sample binding to chip sequence 
fragment no, 46659_at, were validated by quantitative RT-PCR (Q-RT-PCR) using &e 
Taqman® assay (Perkin-Elmer). PGR primers (5'- 

AAGGGTTTATGAGGTCTGGATATAGAATG-3' (SEQ ID NO: 51) and 5'- 

5 GCAAAGAACCCTAATGCTATTTATCAGC-3' (SEQ ID NO: 52)) designed based on the 
sequence information file of the specific Afifymetrix fragment (46659__at) were used in the 
assay. The target gene in each RNA sample (10 ng of total KNA) was assayed relative to a 
reference gene. For this purpose, primers (5'-GTTTTTCCTAATTTTGGCATGAAC-3' 
(SEQ ID NO: 19) and 5'-CGCCCAAGCTTTTCCTTTT-3' (SEQ ID NO: 20)) specific to 

10 the CTBPl gene (C-terminal binding protem 1) were used to serve as control primers. This 
approach provides the relative expression as measured by cycle threshold (Ct) value of the 
target mRNA relative to an amount of CTBPl Ct value. The sample panel included total 
KNA pairs of normal and tumor tissues from kidney, lung, ovary, and pancreas (Ambion, 
Inc., Austin, TX). The Q-RT-PCR data confirms the diflferential regulation of LFG5 in 

15 cancer compared to normal samples. 

EXAMPLE 10: Cloning of Full-Length Human cDNA fLFG5^ Corresponding to 
Dififerentiallv Expressed mRNA Species 

The full-length cDNA having SEQ ID NO: 11 was obtained by the oligo-puUing 
method using the GeneTrapper assay (Life Technologies, Rockville, MD). Briefly, a gene- 

20 specific oligo (5*-GAGAAGACCAGGGAAGAAGCAG-3' (SEQ ID NO: 53)) was 
designed based on the sequence of an EST containing 46659_at sequence. The oligo was 
labeled with biotin and used to hybridize with 5 ^ig of single strand plasmid DNA (cDNA 
recombinants) from a human heart library (ResGen, Huntsville, AL) following the 
procedures of Sambrook et al The hybridized cDNAs were separated by streptavidin- 

25 conjugated beads and eluted by heating. The eluted cDNA was converted to double strand 
plasmid DNA and used to transform E. colt cells (DHIOB) and the longest cDNA was 
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screened. After positive selection was confirmed by PGR using gene-specific primers, the 
cDNA clone was subjected to DNA sequencing. 

The nucleotide sequence of the full-length human cDNAs corresponding to the 
differentially regulated mRNA detected above is set forth in SEQ ID NO: 11. The cDNA 
5 comprises 3098 base pairs. 

An open reading frame within die cDNA nucleotide sequence of SEQ ID NO: 1 1, at 
nucleotides 223-1569 (223-1572 including the stop codon), encodes a protein of 449 amino 
acids. The amino acid sequence corresponding to a predicted protein encoded by SEQ ID 
NO: 1 1 is set forth m SEQ ID NO: 12. 

10 SEQ ID NO: 12 contains a thymidylate kinase domain (amino acid positions 257- 

438). Thymidylate kinase is a member of nucleotide monophosphate kinases (NMPKs) 
which play roles in the nucleotide synthesis for RNA and DNA synthesis and are required 
for the pharmacological activation of therapeutic nuicleoside and nucleotide analogs (Van 
Rompay etal (2000), Pharmacology & Therapeutics 87:189-198). SEQ ID NO: 12 exhibits 

15 homology to a mouse thymidylate kinase (GenBank Accession No. NM_020557) which is 
induced during macrophage activation (Lee and O'Brien (1995), J ImmtmoL 154:6094- 
6102). It shows 63% identity over the entire contiguous sequence. 

Figure 7 shows the results of a hydrophobicity analysis of the amino acid sequence 
of SEQ ID NO: 12 usmg Kyte-Doolittle values (Kyte and Doolittle (1982), J. Mol BioL 
20 157:105-142). Hydrophilic regions may be used to produce antigenic peptides, as described 
above. 

Analysis by Northern blot was performed to determine the size of the mRNA 
transcripts that correspond to LFG5. A Northern blot containing total RNAs from various 
human tissues was used (Human MTN Blot, Clontech, Palo Alto, CA), and an EST 
25 containing 8294 l_at sequence was radioactively labeled by the random primer method and 
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used to probe the blot The blot was hybridized in 50% foimamide, 5X SSPE, 0.1% SDS, 
5X Denhart's solution, and 0.2 mg/ml herring sperm DNA at 4T>C and washed with 0.2X 
SSC containing 0.1% SDS at room temperature. The Northern blot showed a single 
transcript for this gene, which is approximately 3.0 kb in size. This corresponds to the size 
5 of the LFG5 clone (SEQ ID NO: 1 1). 

RyAMPT.E 1 1: Identification of Differentiall y Rxpressed mRNA in Cancers - 6 

The process in EXAMPLE 1 was repeated except that the marker LFG6 was used 
instead of the marker LFGl . 

Analysis of the chip data showed that the expression of the marker LFG6 was 
10 significantly up-regulated in cancer tissue samples compared to samples from normal tissue. 
The expression level of LFG6 (SEQ ID NO: 13 or 15) can be measured by chip sequence 
fragment no. 44103_at on Afifymetrix GeneChips® U95. The 44103_at sequence is derived 
from the EST AA865614. The expression levels of 44103_at in various malignant 
neoplasms, compared to normal control tissues, are shown in Table 7, where the fold- 
15 change, the direction of the change (up- or down-regulation), p-value are also indicated. The 
fold change (cancerous/normal) was calculated by comparing the geometric mean of 
average difference m a cancerous sample set against the geometric mean of average 
difference in the normal tissue sample set. A fold change greater than 1 .5 was considered to 
be significant (Wodicka et al. (1997), Nature Biotech 15:1359-1367). Also mdicated in the 
20 Table 7 are, for each tissue type, the numbers of samples that are called present, absent, or 
marginal together with the total number of samples in that sample set. These data indicate 
that up-regulation of LFG6 may be diagnostic for cancer. 
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The GeneChip expression icsults, detennined by sample binding to chip sequence 
fragment no. 44103_at, were validated by quantitative RT-PCR (Q-RT-PCR) using the 
Taqman® assay (Perkin-Elmer). PGR primers (5'-GGACGGGGAACTTGGACGC-3' 
(SEQ ID NO: 54) and 5'-AAGTGCAGGGCCTCTGGGTG-3' (SEQ ID NO: 55)) designed 

5 based on tiie sequence information file of the specific Afifymetrix fragment (44103_at) were 
used in the assay. Hie target gene in each RNA sample (10 ng of total RNA) was assayed 
relative to a reference gene. For this purpose, primers (5'- 

GTTTTTCCTAATTTTGGCATGAAC-3' (SEQ ID NO: 19) and 5'- 
CGCCCAAGCTTTTCCTTTT-3' (SEQ ID NO: 20)) specific to the CTBPl gene (C- 

10 terminal binding protein 1) were used to serve as control primers. This approach provides 
the relative expression as measured by cycle threshold (Ct) value of the target mRNA 
relative to an amount of CTBPI Ct value. The sample panel included total RNA pairs of 
normal and tumor tissues from liver and ovary (Ambion, Inc., Austin, TX). The Q-RT-PCR 
data confirms the up-regulation of LFG6 in cancer compared to normal samples. 

15 EXAMPLE 12: Cloning of Full-Leneth Human cDNA fLFGg> Corresponding to 
Differentially Expressed mRNA Species 

The full-length cDNA having SEQ ID NO: 13 or 15 was obtained by the oligo- 
pulling method using the GeneTrapper assay (Life Technologies, Rockville, MD). Briefly, 
a gene-specific oligo (5'-CGCTGGGTCATCGGACGGT-3' (SEQ ID NO: 56)) was 

20 designed based on the sequence of an EST containing 44103_at sequence. The oligo was 
labeled with biotin and used to hybridize with 5 ^ig of single strand plasmid DNA (cDNA 
recombinants) from a fully differentiated human stomach adenocarcinoma library (ResGen, 
Huntsville, AL) following tiie procedures of Sambrook et al The hybridized cDNAs were 
separated by streptavidin-conjugated beads and eluted by heating. The eluted cDNA was 

25 converted to double strand plasmid DNA and used to transform E. coli cells (DHIOB) and 
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the longest cDNA was screened. After positive selection was confirmed by PGR using 
gene-specific primers, the cDNA clone was subjected to DNA sequencing. 

The nucleotide sequence of the full-length human cDNAs corresponding to the 
differentially regulated mRNA detected above is set forth in SEQ ID NOS: 13 and 15, In the 
5 former, the cDNA comprises 1893 base pairs. In the latter, the cDNA comprises 1597 base 
pairs. 

An open reading firame within the cDNA nucleotide sequence of SEQ ID NO: 13, at 
nucleotides 418-1392 (418-1395 including the stop codon), encodes a protein of 325 amino 
acids. The amino acid sequence corresponding to a predicted protein encoded by SEQ ID 
10 NO: 13 is set forth in SEQ ID NO: 14. Figure 9 shows the results of a hydrophobicity 
analysis of the amino acid sequence of SEQ ID NO: 14 using Kyte-Doolittle values O^^yte 
and Doolittle (1982), J, Mol Biol 157:105-142). Hydrophilic regions may be used to 
produce antigenic peptides, as described above. 

An open reading firame within the cDNA nucleotide sequence of SEQ ID NO: 15, at 
15 nucleotides 271-1431 (271-1434 mcluding the stop codon), encodes a protein of 387 amino 
acids. The amino acid sequence corresponding to a predicted protein encoded by SEQ ID 
NO: 15 is set forth in SEQ ID NO: 16. Figure 10 shows the results of a hydrophobicity 
analysis of the amino acid sequence of SEQ ID NO: 16 using Kyte-Doolittle values (Kyte 
and Doolittle (1982), J. Mol Biol 157:105-142). Hydrophilic regions may be used to 
20 produce antigenic peptides, as described above. 

SEQ ID NOS: 14 and 16 contain ubiquitin homologues (UBQ) domain (amino acid 
positions 239-300). SEQ ID NOS: 14 and 16 are similar to rat Shaipin protein (Lim et al 
(2001), Mol Cell Neurosci 17:385-397). Shaipin dkectly interacts with the ankyrin repeats 
of Shank protein which functions in the organization of cytoskeletal complexes and 
25 mtracellular signaling at specialized cell junctions (Sheng and Kim (2000), J Cell Sci 
113:1851-1856). 
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Analysis by Northern blot was performed to determine the size of the mRNA 
transcripts that correspond to LFG6. A Northan blot containing total RNAs firom various 
human tissues was used (Human 12-Lane MTN Blot, Clontech, Palo Alto, CA), and an EST 
containing 44103_at sequence was radioactively labeled by the random primer method and 
5 used to probe the blot. The blot was hybridized in 50% formamide, 5X SSPE, 0. 1% SDS, 
5X Denhart's solution, and 0.2 mg/ml herring sperm DNA at 42*0 and washed with 0.2X 
SSC contaming 0. 1% SDS at room temperature. The Northern blot showed three transcripts 
for this gene, which are approximately 2.2 kb, 1.5 kb, and 1.2 kb m size. This corre^nds 
to the sizes of the LFG6 clones (SEQ ID NO: 13 and 15), 

10 Althougji the present invention has been described m detail with reference to 

examples above, it is understood that various modifications can be made without departing 
fiom the spirit of the invention. Accordingly, the invention is limited only by the following 
claims. All cited patents, patent applications and publications referred to in this application 
are herein incorporated by reference in their entirety. 

15 
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WHAT TS CI^AIMED IS; 

1. An isolated nucleic acid molecule selected from the group consisting of: (a) 
an isolated nucleic acid molecule comprising SEQ ID NO: 1, 3, 5, 7, 9, 1 1, 13 or 15, (b) an 
isolated nucleic acid molecule encoding SEQ ID NO: 2, 4, 6, 8, 10, 12. 14 or 16, (c) an 

5 isolated nucleic acid molecule that encodes a protein that is expressed in cancer and that 
exhibits at least about 75% nucleotide sequence identity over the entire contiguous sequence 
of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13 or 15, and (d) an isolated nucleic acid molecule 
comprising the complement of a nucleic acid molecule of (a), (b) or (c). 

2. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid 
10 molecule comprises nucleotides 390-4880 of SEQ ID NO: 1, nucleotides 12-4904 of SEQ 

ID NO: 3, nucleotides 424-1908 of SEQ ID NO: 5, nucleotides 405-1835 of SEQ ID NO: 7, 
nucleotides 89-1150 of SEQ ID NO: 9, nucleotides 223-1569 of SEQ ID NO: 11, 
nucleotides 418-1392 of SEQ ID NO: 13, or nucleotides 271-1431 of SEQ ID NO: 15. 

3. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid 
15 molecule comprises nucleotides 390-4883 of SEQ ID NO: 1, nucleotides 12-4907 of SEQ 

ID NO: 3, nucleotides 424-1911 of SEQ ID NO: 5, nucleotides 405-1838 of SEQ ID NO: 7, 
nucleotides 89-1153 of SEQ ID NO: 9, nucleotides 223-1572 of SEQ ID NO: 11, 
nucleotides 418-1395 of SEQ ID NO: 13, or nucleotides 271-1434 of SEQ ID NO: 15. 

4. The isolated nucleic acid molecule of claim 1, wherem the nucleic acid 
20 molecule consists of nucleotides 390-4883 of SEQ ID NO: 1, nucleotides 12-4907 of SEQ 

ID NO: 3, nucleotides 424-1908 of SEQ ID NO: 5, nucleotides 405-1835 of SEQ ID NO: 7, 
nucleotides 89-1153 of SEQ ID NO: 9, nucleotides 223-1569 of SEQ ID NO: 11, 
nucleotides 418-1395 of SEQ ID NO: 13, or nucleotides 271-1434 of SEQ ID NO: 15. 

5. The isolated nucleic acid molecule of any one of claims 1-4, wherein said 
25 nucleic acid molecule is operably linked to one or more ejcpression control elements. 
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6. 



A vector comprising an isolated nucleic acid molecule of any one of claims 



1^. 



7. A host cell transformed to contain the nucleic acid molecule of any one of 
claims 1-4. 

8. A host cell comprising a vector of claim 6. 

9. A host cell of claim 8, wherein said host cell is selected from the group 
consisting of prokaryotic host cells and eukaryotic host cells. 

10. A method for producing a polypeptide comprising culturing a host cell 
transformed with the nucleic acid molecule of any one of claims 1-4 under conditions in 
which the protein encoded by said nucleic acid molecule is expressed. 

11. The method of claim 10, wherein said host cell is selected from the group 
consisting of prokaryotic host cells and eukaryotic host cells. 

12. An isolated polypeptide produced by the method of claim 10. 

13. An isolated polypeptide or protein selected from the group consistmg of a 
protein comprising the amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16, and 
a protein having at least about 75% ammo acid sequence identity with SEQ ID NO: 2, 4, 6, 
8, 10, 12, 14 or 16. 

14. An isolated antibody or antigen-binding antibody fragment that binds to a 
polypeptide of claim 13. 

15. An antibody of claim 14 wherein said antibody is a monoclonal or a 
polyclonal antibody. 

16. A method of identifying an agent which modulates the expression of a 
nucleic acid encoding a protein of claim 13, comprising: 
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exposing cells which e:^ress the nucleic acid to the agent; and 

determining whether the agent modulates expression of said nucleic acid, 
thereby identifying an agent which modulates the expression of a nucleic acid 
encoding the protein. 

17. A method of identifying an agent which modulates the level of or at least one 
activity of a protein of dahn 13, comprising: 

exposing cells which express the protein to the agent; 

determining whether the agent modulates the level of or at least one activity 
of said protein, thereby identifying an agent which modulates the level of or at least 
one activity of the protein. 

18. The method of claim 17, wherein the agent modulates one activity of the 

protein. 

19. A method of modulating the expression of a nucleic acid encoding a protein 
of claim 13, comprising: 

administering an effective amount of an agent which modulates the 
expression of a nucleic acid encoding the protein. 

20. A method of modulating at least one activity of a protein of claim 13, 
comprising: 

administering an effective amount of an agent which modulates at least one 
activity of the protein. 

21. A method of identifying binding partners for a protein of claim 13, 
comprising: 

exposing said protein to a potential binding partner; and 
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determining if the potential binding partner binds to said protein, thereby 
identifying binding partners for the protein. 

22. A method of identifying an agent which modulates the interaction between a 
binding partner of claim 21 and a protein of claim 13 , comprising: 

S exposuig said protein with s^d partner to the agent; and 

determining whether the agent modulates association of the binding partner 
with said protein, thereby identifying an agent which modulates association of a 
binding partner with said protein. 

23. A method of modulating the interaction between a binding partner of claim 
10 21 and a protein of claim 13, comprising: 

administering an effective amount of an agent which modulates association 
of a binding partner with said protein. 

24. A non-human transgenic animal modified to contain a nucleic acid molecule 
of any of claims 1-4. 

IS 25. The transgenic animal of claim 24, wherein the nucleic acid molecule 

contains a mutation that prevents expression of the encoded protein. 

26. A method of treating a disease state in a subject, comprising: 

inserting into a diseased cell a gene construct comprising an isolated nucleic 
acid molecule of any one of claims 1-4 linked to a promoter or enhancer element 
20 such that expression of said nucleic molecule causes suppression of said disease. 

27. The method of claim 26, wherein said inserting into a diseased cell is 
accomplished in vivo. 
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28. The method of cleum 26, wherein said inserting into a diseased cell further 
comprises use of a viral or plasmid agent and is accomplished either in vitro or in vivo. 

29. A method of diagnosing a disease state in a subject, comprising: 

determining the level of e;q)ression of a nucleic acid molecule or protein of 
5 any one of clmms 1-4 or 13. 

30. The method of claims 26 and 29, wherein the disease state is cancer. 

31. The method of claims 26 and 29, wherein die disease state is a malignant 
neoplasm. 

32. The method of claim 31, wherein the malignant neoplasm occurs in the 
10 breast, colon, esophagus, kidney, liver, lung, lymph node, ovary, pancreas, prostate, rectum, 

and/or stomach. 

33. A composition comprising a diluent and a polypeptide or protein selected 
from the group consisting of: an isolated polypeptide comprising the amino acid sequence 
of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16; an isolated polypeptide comprismg a fragment of 

15 at least 10 amino acids of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16; an isolated polypeptide 
comprising conservative amino acid substitutions of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 
16;an isolated polypeptide comprising naturally occurring amino acid sequence variants of 
SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16; and an isolated polypeptide exhibiting at least 
about 75% amino acid sequence identity with SEQ ID N0:2, 4, 6, 8, 10, 12, 14 or 16. 

20 
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Sequence Listing 



<110> LG Life Sciences Ltd. 

<120> Gene Famllilies Associated With Cancers 

<130> ItliGBPCTOOe 

<150> US60/419911 

<151> 2002-10-18 

<150> US60/419912 

<151> 2002-10-18 

<150> US60/420088 

<151> 2002-10-18 

<150> US60/434243 

<151> 2002-12-16 

<150> US60/434278 

<151> 2002-12-16 

<150> US60/438278 

<151> 2003-01-03 

<160> 56 

<170> Kopatentin 1.71 

<210> 1 

<211> 5293 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (390) . . (4880) 

<223> LBFL109 Clone A 



<400> 1 
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gtcctgtctg gcggtgccga cggtgagggg cggtggccca acggcgggag attcaaacct 60 

ggaagaagga ggaacatgga gaggagagca gcgggcccag gctgggcagc ctctggatcg 120 

aggcctgcct gaaggaggag cttccttccc cggtggagct ggaggagagc cttcggaatg 180 

gagtgctgct ggccaagctg ggccactgtt ttgcaccctc cgtggttccg ttgaagaaga 240 

tctacgatgt ggagcagctg cggtaccagg caactggctt acatttccgt cacacagaca 300 

acatcaactt ttggctatct gcaatagccc acatcggtct gccttcgacc ttcttcccag 360 

agaccacgga catctatgac aaaaagaac atg ccc egg gta gtc tac tgc ate 413 

Met Pro Arg Val Val Tyr Cys He 
1 5 

cat get etc agt etc tte etc ttc egg ctg gga ttg gee eet cag ata 461 
His Ala Leu Ser Leu Phe Leu Phe Arg Leu Gly Leu Ala Pro Gin He 
10 15 20 

eat gat eta tac ggg aaa gtg aaa ttc aea get gag gaa etc age aae 509 
His Asp Leu Tyr Gly Lys Val Lys Phe Thr Ala Glu Glu Leu Ser Asn 
25 30 35 40 

atg gcg tec gaa ctg gee aaa tat ggc etc cag ctg eet gee tte age 557 
Met Ala Ser Glu Leu Ala Lys Tyr Gly Leu Gin Leu Pro Ala Phe Ser 
45 50 55 

aag ate ggg ggc ate ttg gee aat gag etc teg gtg gat gag get gca 605 
Lys He Gly Gly Xle Leu Ala Asn Glu Leu Ser Val Asp Glu Ala Ala 
60 65 70 

gtc cat gca get gtt ett gee ate aat gaa gca gtg gag ega ggg gtg 653 
Val His Ala Ala Val Leu Ala He Asn Glu Ala Val Glu Arg Gly Val 
75 80 85 

gtg gag gac acc ctg get gee ttg cag aat ccc agt get ett ctg gag 701 
Val Glu Asp Thr Leu Ala Ala Leu Gin Asn Pro Ser Ala Leu Leu Glu 
90 95 100 



2/91 



wo 2004/035789 



Sequence Listing 



PCT/KR2003/002161 



aat etc cga gag cct ctg gca gcc gtc tac cag gag atg ctg gcc cag 
Asn Leu Arg Glu Pro Leu Ala Ala Val Tyr Gin Glu Met Leu Ala Gin 
105 110 115 120 



749 



gcc aag atg gag aag gca gcc aat gcc agg aac cat gat gac aga gaa 
Ala Lys Met Glu Lys Ala Ala Asn Ala Arg Asn His Asp Asp Arg Glu 
125 130 135 



797 



age cag gac ate tat gac cac tac eta act cag get gaa ate cag ggc 
Ser Gin Asp lie Tyr Asp His Tyr Leu Thr Gin Ala Glu lie Gin Gly 
140 145 150 



845 



aat ate aac cat gtc aac gtc cat ggg get eta gaa gtt gtt gat gat 
Asn lie Asn His Val Asn Val His Gly Ala Leu Glu Val Val Asp Asp 
155 160 165 



893 



gcc ctg gaa aga cag age cct gaa gee ttg etc aag gcc ett eaa gac 
Ala Leu Glu Arg Gin Ser Pro Glu Ala Leu Leu Lys Ala Leu Gin Asp 
170 175 180 



941 



cct gee ctg gee ctg ega ggg gtg agg aga gac ttt get gac tgg tac 
Pro Ala Leu Ala Leu Arg Gly Val Arg Arg Asp Phe Ala Asp Trp Tyr 
185 190 195 200 



989 



ctg gag cag ctg aac tea gac aga gag cag aag gca cag gag ctg ggc 
Leu Glu Gin Leu Asn Ser Asp Arg Glu Gin Lys Ala Gin Glu Leu Gly 
205 210 215 



1037 



ctg gtg gag ett ctg gaa aag gag gaa gtc cag get ggt gtg get gca 
Leu Val Glu Leu Leu Glu Lys Glu Glu Val Gin Ala Gly Val Ala Ala 
220 225 230 



1085 



gcc aac aca aag ggt gat cag gaa eaa gee atg etc cac get gtg cag 
Ala Asn Thr Lys Gly Asp Gin Glu Gin Ala Met Leu His Ala Val Gin 
235 240 245 



1133 



egg ate aac aaa gee ate egg agg gga gtg gcg get gac act gtg aag 
Arg Xle Asn Lys Ala lie TVrg Arg Gly Val Ala Ala Asp Thr Val Lys 
250 255 260 



1181 



3/91 



wo 2004/035789 



Sequence Listing 



PCT/KR2003/002161 



gag ctg atg tgc cct gag gcc cag ctg cct cca gtg tac cct gtt gca 
Glu Leu Met Cys Pro Glu Ala Gin Leu Pro Pro Val Tyr Pro Val Ala 
265 270 275 280 



1229 



teg tct atg tac cag ctg gag ctg gca gtg etc cag cag cag cag ggg 
Ser Ser Met Tyr Gin Leu Glu Leu Ala Val Leu Gin Gin Gin Gin Gly 
285 290 295 



1277 



gag ctt ggc cag gag gag etc ttc gtg get gtg gag atg etc tea get 
Glu Leu Gly Gin Glu Glu Leu Phe Val Ala Val Glu Met Leu Ser Ala 
300 305 310 



1325 



gtg gtc ctg att aac egg gcc ctg gag gcc egg gat gcc agt ggc ttc 
Val Val Leu lie Asn Arg Ala Leu Glu Ala Arg Asp Ala Ser Gly Phe 
315 320 325 



1373 



tgg age age ctg gtg aac cct gcc aca ggc ctg get gag gtg gaa gga 
Trp Ser Ser Leu Val Asn Pro Ala Thr Gly Leu Ala Glu Val Glu Gly 
330 335 340 



1421 



gaa aat gcc cag cgt tac ttc gat gee ctg ctg aaa ttg cga cag gag 
Glu Asn Ala Gin Arg Tyr Phe Asp Ala Leu Leu Lys Leu Arg Gin Glu 
345 350 355 360 



1469 



cgt ggg atg ggt gag gae ttc ctg age tgg aat gae ctg cag gcc ace 
Arg Gly Met Gly Glu Asp Phe Leu Ser Trp Asn Asp Leu Gin Ala Thr 
365 370 375 



1517 



gtg age cag gtc aat gca cag ace cag gaa gag act gae egg gtc ctt 
Val Ser Gin Val Asn Ala Gin Thr 61n Glu Glu Thr Asp Arg Val Leu 
380 385 390 



1565 



gca gtc age etc ate aat gag get ctg gae aaa ggc age cct gag aag 
Ala Val Ser Leu He Asn Glu Ala Leu Asp Lys Gly Ser Pro Glu Lys 
395 400 405 



1613 



act ctg tct gcc eta ctg ctt cct gca get ggc eta gat gat gtc age 
Thr Leu Ser Ala Leu Leu Leu Pro Ala Ala Gly Leu Asp Asp Val Ser 
410 415 420 



1661 
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etc cct gtc gcc cct egg tae cat etc etc ctt gtg gca gee aaa agg 
Leu Pro Val Ala Pro Arg Tyr His Leu Leu Leu Val TUa Ala Lys Arg 
425 430 435 440 



1709 



cag aag gee eag gtg aca ggg gat cct gga get gtg ctg tgg ett gag 
Gin Lys Ala Gin Val Thr Gly Asp Pro Gly Ala Val Leu Trp Leu Glu 
445 450 455 



1757 



gag ate ego cag gga gtg gtc aga gcc aac cag gac act aat aca get 
Glu lie Arg Gin Gly Val Val Arg Ala Asn Gin Asp Thr Asn Thr Ala 
460 465 470 



1805 



cag aga atg get ctt ggt gtg get gcc ate aat caa gcc ate aag gag 
Gin Arg Met Ala Leu Gly Val Ala Ala lie Asn Gin Ala He Lys Glu 
475 480 485 



1853 



ggc aag gca gcc cag act gag egg gtg ttg agg aac cec gca gtg gcc 
Gly Lys Ala Ala Gin Thr Glu Arg Val Leu Arg Asn Pro Ala Val Ala 
490 495 500 



1901 



ett ega ggg gta gtt cec gac tgt gee aac ggc tae eag cga gcc ctg 
Leu Arg Gly Val Val Pro Asp Cys Ala Asn Gly Tyr Gin Arg Ala Leu 
505 510 515 520 



1949 



gaa agt gee atg gca aag aaa cag cgt cea gca gac aca get ttc tgg 
Glu Ser Ala Met Ala Lys Lys Gin Arg Pro Ala Asp Thr Ala Phe Trp 
525 530 535 



1997 



gtt caa cat gac atg aag gat ggc act gcc tac tae ttc eat ctg cag 
Val Gin His Asp Met Lys Asp Gly Thr Ala Tyr Tyr Phe His Leu Gin 
540 545' 550 



2045 



ace ttc cag ggg ate tgg gag caa cct cct ggc tgc cec etc aac ace 
Thr Phe Gin Gly He Trp Glu Gin Pro Pro Gly Cys Pro Leu Asn Thr 
555 560 565 



2093 



tct cac ctg ace egg gag gag ate cag tea get gtc ace aag gtc act 
Ser His Leu Thr Arg Glu Glu He Gin Ser Ala Val Thr Lys Val Thr 
570 575 580 



2141 
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get gcc tat gac cgc caa cag etc tgg aaa gee aac gte ggc ttt gtt 
Ala Ala Tyr Asp TVrg Gin Gin Leu Trp Lys Ala Asn Val Gly Phe Val 
585 590 595 600 



2189 



ate cag etc cag gcc cgc etc cgt ggc ttc eta gtt egg cag aag ttt 
lie Gin Leu Gin Ala Arg Leu Arg Gly Phe Leu Val Arg Gin Lys Phe 
605 610 615 



2237 



get gag cat tec cae ttt ctg agg ace tgg etc cea gca gte ate aag 
Ala Glu His Ser His Phe Leu Arg Thr Trp Leu Pro Ala Val He Lys 
620 625 630 



2285 



ate cag get cat tgg egg ggt tat agg cag egg aag att tac ctg gag 
He Gin Ala His Trp Arg Gly Tyr Arg Gin Arg Lys He Tyr Leu Glu 
635 640 645 



2333 



tgg ttg cag tat ttt aaa gca aae ctg gat gee ata ate aag ate eag 
Trp Leu Gin Tyr Phe Lys Ala Asn Leu Asp Ala He He Lys He Gin 
650 655 660 



2381 



gee tgg gee egg atg tgg gca get egg agg caa tac ctg agg cgt ctg 
Ala Trp Ala Arg Met Trp Ala Ala Arg Arg Gin Tyr Leu Arg Arg Leu 
665 670 675 680 



2429 



cae tac ttc eag aag aat gtt aac tee att gtg aag ate cag gca ttt 
His Tyr Phe Gin Lys Asn Val Asn Ser He Val Lys He Gin Ala Phe 
685 690 695 



2477 



ttc cga gcc agg aaa gee caa gat gac tac agg ata tta gtg cat gca 
Phe Arg Ala Arg Lys Ala Gin Asp Asp Tyr Arg He Leu Val His Ala 
700 705 710 



2525 



cec cae cet ect etc agt gtg gta cgc aga ttt gee cat etc ttg aat 
Pro His Pro Pro Leu Ser Val Val Arg Arg Phe Ala His Leu Leu Asn 
715 720 725 



2573 



caa age eag caa gac ttc ttg get gag gca gag ctg ctg aag etc cag 
Gin Ser Gin Gin Asp Phe Leu Ala Glu Ala Glu Leu Leu Lys Leu Gin 
730 735 740 



2621 



6/91 



wo 2004/035789 



Sequence Listing 



PCT/KR2003/002161 



gaa gag gta gtt agg aag ate cga tec aat cag cag ctg gag cag gac 
Glu Glu Val Val Arg Lys lie Arg Ser Asn Gin Gin Leu Glu Gin Asp 
745 750 755 760 



2669 



etc aac ate atg gac ate aag att gge ctg ctg gtg aag aac egg ate 
Leu Asn lie Met Asp He Lys He Gly Leu Leu Val Lys Asn Arg He 
765 770 775 



2717 



act ctg cag gaa gtg gtc tec cac tgc aag aag ctg ace aag agg aat 
Thr Leu Gin Glu Val Val Ser His Cys Lys Lys Leu Thr Lys Arg Asn 
780 785 790 



2765 



aag gaa cag ctg tea gat atg atg gtt ctg gac aag cag aag ggt tta 
Lys Glu Gin Leu Ser Asp Met Met Val Leu Asp Lys Gin Lys Gly Leu 
795 800 805 



2813 



aag teg ctg age aaa gag aaa egg cag aaa eta gaa gca tac caa cac 
Lys Ser Leu Ser Lys Glu Lys Arg Gin Lys Leu Glu Ala Tyr Gin His 
810 815 820 



2861 



etc ttc tac ctg etc cag act cag ccc ate tac ctg gee aag ctg ate 
Leu Phe Tyr Leu Leu Gin Thr Gin Pro He Tyr Leu Ala Lys Leu He 
825 830 835 840 



2909 



ttt cag atg cca cag aac aaa acc acc aag ttc atg gag gca gtg att 
Phe Gin Met Pro Gin Asn Lys Thr Thr Lys Phe Met Glu Ala Val He 
845 850 855 



2957 



ttc age ctg tac aac tat gee tec age cge cga gag gee tat etc ctg 
Phe Ser Leu Tyr Asn Tyr Ala Ser Ser Arg Arg Glu Ala Tyr Leu Leu 
860 865 870 



3005 



etc cag ctg ttc aag aca gca etc cag gag gaa ate aag tea aag gtg 
Leu Gin Leu Phe Lys Thr Ala Leu Gin Glu Glu He Lys Ser Lys Val 
875 880 885 



3053 



gag cag ccc cag gac gtg gtg aca gge aac cca aca gtg gtg agg ctg 
Glu Gin Pro Gin Asp Val Val Thr Gly Asn Pro Thr Val Val Arg Leu 
890 895 900 



3101 
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gtg gtg aga ttc tac cgt aat ggg egg gga cag agt gcc ctg cag gag 
Val Val Arg Phe Tyr Arg Asn Gly Arg Gly Gin Ser Ala Leu Gin Glu 
905 910 915 920 



3149 



att ctg ggc aag gtt ate cag gat gtg eta gaa gac aaa gtg etc age 
lie Leu Gly Lys Val lie Gin Asp Val Leu Glu Asp Lys Val Leu Ser 
925 930 935 



3197 



gtc eac aca gac cct gtc cac etc tat aag aac tgg ate aac cag act 
Val His Thr Asp Pro Val His Leu Tyr Lys Asn Trp lie Asn Gin Thr 
940 945 950 



3245 



gag gcc cag aca ggg cag cgc age cat etc eca tat gat gtc acc ccg 
Glu Ala Gin Thr Gly Gin Arg Ser His Leu Pro Tyr Asp Val Thr Pro 
955 960 965 



3293 



gag cag gcc ttg age cac cec gag gtc cag aga cga ctg gac ate gcc 
Glu Gin Ala Leu Ser His Pro Glu Val Gin Arg Arg Leu Asp He Ala 
970 975 980 



3341 



eta cgc aac etc etc gee atg act gat aag ttc ctt tta gcc ate ace 
Leu Arg Asn Leu Leu Ala Met Thr Asp Lys Phe Leu Leu Ala He Thr 
985 990 995 1000 



3389 



tea tet gtg gac caa att ccg tat ggg atg cga tat gtg gee aaa gtc 
Ser Ser Val Asp Gin He Pro Tyr Gly Met Arg Tyr Val Ala Lys Val 
1005 1010 1015 



3437 



ctg aag gca act ctg gca gag aaa ttc cct gac gee aca gac age gag 
Leu Lys Ala Thr Leu Ala Glu Lys Phe Pro Asp Ala Thr Asp Ser Glu 
1020 1025 1030 



3485 



gtc tat aag gtg gtc ggg aac etc ctg tac tac cgc ttc ctg aac eca 
Val Tyr Lys Val Val Gly Asn Leu Leu Tyr Tyr Arg Phe Leu Asn Pro 
1035 1040 1045 



3533 



get gtg gtg get cct gac gee ttc gac att gtg gee atg gca get ggt 
Ala Val Val Ala Pro Asp Ala Phe Asp He Val Ala Met Ala Ala Gly 
1050 1055 1060 



3581 
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gga gcc ctg get gcc ccc cag cgc cat gcc ctg ggg get gtg get cag 
Gly Ala Leu Ala Ala Pro Gin Arg His Ala Leu Gly Ala Val Ala Gin 
1065 1070 1075 1080 



3629 



etc eta cag eac get gcg get gge aag gcc ttc tet ggg cag age cag 
Leu Leu Gin His Ala Ala Ala Gly Lys Ala Phe Ser Gly Gin Ser Gin 
1085 1090 1095 



3677 



eac eta egg gtc ctg aat gac tat ctg gag gaa aea eac etc aag ttc 
His Leu Arg Val Leu Asn Asp Tyr Leu Glu Glu Thr His Leu Lys Phe 
1100 1105 1110 



3725 



agg aag ttc ate cat aga gcc tgc cag gtg cca gag cca gag gag cgt 
Arg Lys Phe He His Arg Ala Cys Gin Val Pro Glu Pro Glu Glu Arg 
1115 1120 1125 



3773 



ttt gca gtg gac gag tac tea gac atg gtg get gtg gcc aaa ccc atg 
Phe Ala Val Asp Glu Tyr Ser Asp Met Val Ala Val Ala Lys Pro Met 
1130 1135 1140 



3821 



gtg tac ate acc gtg ggg gag ctg gtc aac acg cac agg ctg ttg ctg 
Val Tyr He Thr Val Gly Glu Leu Val Asn Thr His Arg Leu Leu Leu 
1145 1150 1155 1160 



3869 



gag cac cag gac tgc att gee cct gat cac caa gac ccc ctg cat gag 
Glu His Gin Asp Cys He Ala Pro Asp His Gin Asp Pro Leu His Glu 
1165 1170 1175 



3917 



etc ctg gag gat ett ggg gag ctg ccc ace ate cct gac ett att ggt 
Leu Leu Glu Asp Leu Gly Glu Leu Pro Thr He Pro Asp Leu He Gly 
1180 1185 1190 



3965 



gag age ate get gca gat ggg cac aea gac ctg age aag eta gaa gtg 
Glu Ser He Ala Ala Asp Gly His Thr Asp Leu Ser Lys Leu Glu Val 
1195 1200 1205 



4013 



tee ctg acg ctg ace aac aag ttt gaa gga eta gag gca gat get gat 
Ser Leu Thr Leu Thr Asn Lys Phe Glu Gly Leu Glu Ala Asp Ala Asp 
1210 1215 1220 



4061 
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gac tec aac acc cgt age ctg ctt ctg age ace aag cag ctg ttg gee 4109 
Asp Ser Asn Thr Arg Ser Leu Leu Leu Ser Thr Lys Gin Leu Leu Ala 
1225 1230 1235 1240 

gat ate ata cag ttc cat cct ggg gac acc etc aag gag ate ctg tec 4157 
Asp He He Gin Phe His Pro Gly Asp Thr Leu Lys Glu He Leu Ser 
1245 1250 1255 

etc teg get tec aga gag caa gaa gea gee cac aag cag ctg atg age 4205 
Leu Ser Ala Ser Arg Glu Gin Glu Ala Ala His Lys Gin Leu Met Ser 
1260 1265 1270 

ega cge cag gee tgt aca gee cag aca ceg gag eca ctg cga cga cac 4253 
Arg Arg Gin Ala Cys Thr Ala Gin Thr Pro Glu Pro Leu Arg Arg His 
1275 1280 1285 

cgc tea ctg aca get cac tec etc ctg eca ctg gea gag aag cag egg 4301 
Arg Ser Leu Thr Ala His Ser Leu Leu Pro Leu Ala Glu Lys Gin Arg 
1290 1295 1300 

cgc gte ctg egg aac ctg cgc cga ctt gaa gee ctg ggg ttg gtc age 4349 
Arg Val Leu Arg Asn Leu Arg Arg Leu Glu Ala Leu Gly Leu Val Ser 
1305 1310 1315 1320 

gcc aga aat ggc tac cag ggg eta gtg gac gag ctg gee aag gac ate 4397 
Ala Arg Asn Gly Tyr Gin Gly Leu Val Asp Glu Leu Ala Lys Asp He 
1325 1330 1335 

cgc aac cag cac aga cac agg cac agg egg aag gea gag ctg gtg aag 4445 
Arg Asn Gin His Arg His Arg His Arg Arg Lys Ala Glu Leu Val Lys 
1340 1345 1350 

ctg cag gee aca tta cag ggc ctg age act aag acc acc ttc tat gag 4493 
Leu Gin Ala Thr Leu Gin Gly Leu Ser Thr Lys Thr Thr Phe Tyr Glu 
1355 1360 1365 

gag cag ggt gac tac tac age cag tac ate egg gcc tge ctg gac cac 4541 
Glu Gin Gly Asp Tyr Tyr Ser Gin Tyr He Arg Ala Cys Leu Asp His 
1370 1375 1380 
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ctg gcc ccc gac tec aag agt tct ggg aag ggg aag aag cag cct tct 4589 
Leu Ala Pro Asp Ser Lys Ser Ser Gly Lys Gly Lys Lys Gin Pro Ser 
1385 1390 1395 1400 

ctt cat tac act get get cag etc ctg gaa aag ggt gtc ttg gtg gaa 4637 
Leu His Tyr Thr Ala Ala Gin Leu Leu Glu Lys Gly Val Leu Val Glu 
1405 1410 1415 

att gaa gat ctt ccc gcc tct cac ttc aga aac gtc ate ttt gac ate 4685 
lie Glu Asp Leu Pro Ala Ser His Phe Arg Asn Val lie Phe Asp He 
1420 1425 1430 

acg ccg gga gat gag gca gga aag ttt gaa gta aat gcc aag ttc ctg 4733 
Thr Pro Gly TVsp Glu Ala Gly Lys Phe Glu Val Asn Ala Lys Phe Leu 
1435 1440 1445 

ggt gtg gac atg gag cga ttt cag ctt cac tat cag gat etc ctg cag 4781 
Gly Val Asp Met Glu Arg Phe Gin Leu His Tyr Gin Asp Leu Leu Gin 
1450 1455 1460 

etc cag tat gag ggt gtg get gtc atg aaa etc ttc aac aag gee aaa 4829 
Leu Gin Tyr Glu Gly Val Ala Val Met Lys Leu Phe Asn Lys Ala Lys 
1465 1470 1475 1480 

gtc aat gtc aac ctt etc ate ttc etc etc aac aag aag ttt ttg egg 4877 
Val Asn Val Asn Leu Leu He Phe Leu Leu han Lys Lys Phe Leu Arg 
1485 1490 1495 

aag tgacagaggc aaagggtgct acccaagccc etcttaccte tctggatgct 4930 

Lys 

ttetttaaea etaaeteaec aetgtgettc ectgcagaea cecagagetc aggactgggc 4990 

aaggcecagg gattctcace ecttccecag ctgggaggag cttgectgee tggccacaga 5050 

cagtgtatct tctaattggc taaagtggge ettgeccaga gtccagctgt gtggetttta 5110 

teatgeatga caaacecetg gctttcctge eagatggatt etcatccett acagetgact 5170 

cttecaggca atttccatag atctgcagte etgcetctgc eacagtetet etgttgtcec 5230 
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cacatctacc caacttcctg tactgttgcc cttctgatgt taataaaagc agctgttact 5290 
ccc 5293 



<210> 2 

<211> 1497 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Met Pro Arg Val Val Tyr Cys lie His Ala Leu Ser Leu Phe Leu Phe 
15 10 15 

Arg Leu Gly Leu Ala Pro Gin He His Asp Leu Tyr Gly Lys Val Lys 
20 25 30 

Phe Thr Ala Glu Glu Leu Ser Asn Met Ala Ser Glu Leu Ala Lys Tyr 
35 40 45 

Gly Leu Gin Leu Pro Ala Phe Ser Lys He Gly Gly He Leu Ala Asn 
50 55 60 

Glu Leu Ser Val Asp Glu Ala Ala Val His Ala Ala Val Leu Ala He 
65 70 75 80 

Asn Glu Ala Val Glu Arg Gly Val Val Glu Asp Thr Leu Ma Ala Leu 
85 90 95 

Gin Asn Pro Ser Ala Leu Leu Glu Asn Leu Arg Glu Pro Leu Ala Ala 
100 105 110 

Val Tyr Gin Glu Met Leu Ala Gin Ala Lys Met Glu Lys Ala Ala Asn 
115 120 125 

Ala Arg Asn His Asp Asp Arg Glu Ser Gin Asp He Tyr Asp His Tyr 
130 135 140 

Leu Thr Gin Ala Glu He Gin Gly Asn He Asn His Val Asn Val His 
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145 150 155 160 

Gly Ala Leu Glu Val Val Asp Asp Ala Leu Glu Arg Gin Ser Pro Glu 
165 170 175 

Ala Leu Leu Lys Ala Leu Gin Asp Pro Ala Leu Ala Leu Arg Gly Val 
180 185 190 

Arg Arg Asp Phe Ala Asp Trp Tyr Leu Glu Gin Leu Asn Ser Asp Arg 
195 200 205 

Glu Gin Lys Ala Gin Glu Leu Gly Leu Val Glu Leu Leu Glu Lys Glu 
210 215 220 

Glu Val Gin Ala Gly Val Ala Ala Ala Asn Thr Lys Gly Asp Gin Glu 
225 230 235 240 

Gin Ala Met Leu His Ala Val Gin Arg lie Asn Lys Ala lie Arg Arg 
245 250 255 

Gly Val Ala Ala Asp Thr Val Lys Glu Leu Met Cys Pro Glu Ala Gin 
260 265 270 

Leu Pro Pro Val Tyr Pro Val Ala Ser Ser Met Tyr Gin Leu Glu Leu 
275 280 285 

Ala Val Leu Gin Gin Gin Gin Gly Glu Leu Gly Gin Glu Glu Leu Phe 
290 295 300 

Val Ala Val Glu Met Leu Ser Ala Val Val Leu lie Asn Arg Ala Leu 
305 310 315 320 

Glu Ala Arg Asp Ala Ser Gly Phe Trp Ser Ser Leu Val Asn Pro Ala 
325 330 335 

Thr Gly Leu Ala Glu Val Glu Gly Glu Asn Ala Gin Arg Tyr Phe Asp 
340 345 350 

Ala Leu Leu Lys Leu Arg Gin Glu Arg Gly Met Gly Glu Asp Phe Leu 
355 360 365 
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Ser Trp Asn Asp Leu Gin Ala Thr Val Ser Gin Val Asn Ala Gin Thr 
370 375 380 

Gin Glu Glu Thr Asp Arg Val Leu Ala Val Ser Leu lie Asn Glu Ala 
385 390 395 400 

Leu Asp Lys Gly Ser Pro Glu Lys Thr Leu Ser Ala Leu Leu Leu Pro 
405 410 415 

Ala Ala Gly Leu Asp Asp Val Ser Leu Pro Val Ala Pro Arg Tyr His 
420 425 430 

Leu Leu Leu Val Ala Ala Lys Arg Gin Lys Ala Gin Val Thr Gly Asp 
435 440 445 

Pro Gly Ala Val Leu Trp Leu Glu Glu lie Arg Gin Gly Val Val Arg 
450 455 460 

Ala Asn Gin Asp Thr Asn Thr Ala Gin Arg Met Ala Leu Gly Val Ala 
465 470 475 480 

Ala lie Asn Gin Ala He Lys Glu Gly Lys Ala Ala Gin Thr Glu Arg 
485 490 495 

Val Leu Arg Asn Pro Ala Val TULa Leu Arg Gly Val Val Pro Asp Cys 
500 505 510 

Ala Asn Gly Tyr Gin Arg Ala Leu Glu Ser Ala Met Ala Lys Lys Gin 
515 520 525 

Arg Pro Ala Asp Thr Ala Phe Trp Val Gin His Asp Met Lys Asp Gly 
530 535 540 

Thr Ala Tyr Tyr Phe His Leu Gin Thr Phe Gin Gly He Trp Glu Gin 
545 550 555 560 

Pro Pro Gly Cys Pro Leu Asn Thr Ser His Leu Thr Arg Glu Glu He 
565 570 575 
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Gin Ser Ala Val Thr Lys Val Thr Ala Ala Tyr Asp Arg Gin Gin Leu 
580 585 590 

Trp Lys Ala Asn Val Gly Phe Val lie Gin Leu Gin Ala Arg Leu Arg 
595 600 605 

Gly Phe Leu Val Arg Gin Lys .Phe Ala Glu His Ser His Phe Leu Arg 
610 615 620 

Thr Trp Leu Pro Ala Val He Lys He Gin Ala His Trp Arg Gly Tyr 
625 630 635 640 

Arg Gin Arg Lys He Tyr Leu Glu Trp Leu Gin Tyr Phe Lys Ala Asn 
645 650 655 

Leu Asp Ala He He Lys He Gin Ala Trp Ala Arg Met Trp Ala Ala 
660 665 670 

Arg Arg Gin Tyr Leu Arg Arg Leu His Tyr Phe Gin Lys Asn Val Asn 
675 680 685 

Ser He Val Lys He Gin Ala Phe Phe Arg Ala Arg Lys Ala Gin 'Asp 
690 695 700 

Asp Tyr Arg He Leu Val His Ala Pro His Pro Pro Leu Ser Val Val 
705 710 715 720 

Arg Arg Phe Ala His Leu Leu Asn Gin Ser Gin Gin Asp Phe Leu Ala 
725 730 735 

Glu Ala Glu Leu Leu Lys Leu Gin Glu Glu Val Val Arg Lys He Arg 
740 745 750 

Ser Asn Gin Gin Leu Glu Gin Asp Leu Asn He Met Asp He Lys He 
755 760 765 

Gly Leu Leu Val Lys Asn Arg He Thr Leu Gin Glu Val Val Ser His 
770 775 780 

Cys Lys Lys Leu Thr Lys Arg Asn Lys Glu Gin Leu Ser Asp Met Met 
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785 790 795 800 

Val Leu Asp Lys Gin Lys Gly Leu Lys Ser Leu Ser Lys Glu Lys Arg 
805 810 815 

Gin Lys Leu Glu Ala Tyr Gin His Leu Phe Tyr Leu Leu Gin Thr Gin 
820 825 830 

Pro lie Tyr Leu Ala Lys Leu lie Phe Gin Met Pro Gin Asn Lys Thr 
835 840 845 

Thr Lys Phe Met Glu Ala Val lie Phe Ser Leu Tyr Asn Tyr Ala Ser 
850 855 860 

Ser Arg Arg Glu Ala Tyr Leu Leu Leu Gin Leu Phe Lys Thr Ala Leu 
865 870 875 880 

Gin Glu Glu He Lys Ser Lys Val Glu Gin Pro Gin Asp Val Val Thr 
885 890 895 

Gly Asn Pro Thr Val Val Arg Leu Val Val Arg Phe Tyr Arg Asn Gly 
900 905 910 

Arg Gly Gin Ser Ala Leu Gin Glu He Leu Gly Lys Val He Gin Asp 
915 920 925 

Val Leu Glu Asp Lys Val Leu Ser Val His Thr Asp Pro Val His Leu 
930 935 940 

Tyr Lys Asn Trp He Asn Gin Thr Glu Ala Gin Thr Gly Gin Arg Ser 
945 950 955 960 

His Leu Pro Tyr TVsp Val Thr Pro Glu Gin Ala Leu Ser His Pro Glu 
965 970 975 

Val Gin Arg Arg Leu Asp He Ala Leu Arg Asn Leu Leu Ala Met Thr 
980 985 990 

Asp Lys Phe Leu Leu Ala He Thr Ser Ser Val Asp Gin He Pro Tyr 
995 1000 1005 
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Gly Met Arg Tyr Val Ala Lys Val Leu Lys Ala Thr Leu Ala Glu Lys 
1010 1015 1020 

Phe Pro Asp Ala Thr Asp Ser Glu Val Tyr Lys Val Val Gly Asn Leu 
1025 1030 1035 1040 

Leu Tyr Tyr Arg Phe Leu Asn Pro Ala Val Val Ala Pro Asp Ala Phe 
1045 1050 1055 

Asp lie Val Ala Met Ala Ala Gly Gly Ala Leu Ala Ala Pro Gin Arg 
1060 1065 1070 

His Ala Leu Gly Ala Val Ala Gin Leu Leu Gin His Ala Ala 2U.a Gly 
1075 1080 1085 

Lys Ala Phe Ser Gly Gin Ser Gin His Leu Arg Val Leu Asn Asp Tyr 
1090 1095 1100 

Leu Glu Glu Thr His Leu Lys Phe Arg Lys Phe lie His Arg Ala Cys 
1105 1110 1115 1120 

Gin Val Pro Glu Pro Glu Glu Arg Phe Ala Val Asp Glu Tyr Ser Asp 
1125 1130 1135 

Met Val Ala Val Ala Lys Pro Met Val Tyr lie Thr Val Gly Glu Leu 
1140 1145 1150 

Val Asn Thr His Arg Leu Leu Leu Glu His Gin Asp Cys He Ala Pro 
1155 1160 1165 

Asp His Gin Asp Pro Leu His Glu Leu Leu Glu Asp Leu Gly Glu Leu 
1170 1175 1180 

Pro Thr He Pro Asp Leu He Gly Glu Ser He Ala Ala Asp Gly His 
1185 1190 1195 1200 

Thr Asp Leu Ser Lys Leu Glu Val Ser Leu Thr Leu Thr Asn Lys Phe 
1205 1210 1215 
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Glu Gly lieu Glu Ala Asp Ala Asp Asp Ser Asn Thr Arg Ser Leu Leu 
1220 1225 1230 

Leu Ser Thr Lys Gin Leu Leu Ala Asp lie lie Gin Phe His Pro Gly 
1235 1240 1245 

Asp Thr Leu Lys Glu lie Leu Ser Leu .Ser Ala Ser Arg Glu Gin Glu 
1250 1255 1260 

Ala Ala His Lys Gin Leu Met Ser Arg Arg Gin Ala Cys Thr Ala Gin 
1265 1270 1275 1280 

Thr Pro Glu Pro Leu Arg Arg His Arg Ser Leu Thr Ala His Ser Leu 
1285 1290 1295 

Leu Pro Leu Ala Glu Lys Gin Arg Arg Val Leu Arg Asn Leu Arg Arg 
1300 1305 1310 

Leu Glu Ala Leu Gly Leu Val Ser Ala Arg Asn Gly Tyr Gin Gly Leu 
1315 1320 1325 

Val Asp Glu Leu Ala Lys Asp lie Arg Asn Gin His Arg His Arg His 
1330 1335 1340 

Arg Arg Lys Ala Glu Leu Val Lys Leu Gin Ala Thr Leu Gin Gly Leu 
1345 1350 1355 1360 

Ser Thr Lys Thr Thr Phe Tyr Glu Glu Gin Gly Asp Tyr Tyr Ser Gin 
1365 1370 1375 

Tyr lie Arg Ala Cys Leu Asp His Leu Ala Pro Asp Ser Lys Ser Ser 
1380 1385 1390 

Gly Lys Gly Lys Lys Gin Pro Ser Leu His Tyr Thr Ala Ala Gin Leu 
1395 1400 1405 

Leu Glu Lys Gly Val Leu Val Glu lie Glu Asp Leu Pro Ala Ser His 
1410 1415 1420 

Phe Arg Asn Val lie Phe Asp lie Thr Pro Gly Asp Glu Ala Gly Lys 
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1425 



1430 



1435 



1440 



Phe Glu Val Asn Ala Lys Phe Leu Gly Val Asp Met Glu Arg Phe Gin 
1445 1450 1455 

Leu His Tyr Gin Asp Leu Leu Gin Leu Gin Tyr Glu Gly Val Ala Val 
1460 14 65 1470 

Met Lys Leu Phe Asn Lys Ala Lys Val Asn Val Asn Leu Leu lie Phe 
1475 1480 1485 

Leu Leu Asn Lys Lys Phe Leu Arg Lys 
1490 1495 



<210> 
<211> 
<212> 
<213> 



3 

5317 
DNA 

Homo sapiens 



<220> 
<221> 
<222> 
<223> 



CDS 

(12) . . (4904) 
LBFL109 Clone B 



<400> 3 
gaaggaggaa c 



atg gag agg aga gca gcg ggc cca ggc tgg gca 
Met Glu Arg Arg Ala Ala Gly Pro Gly Trp Ala 
15 10 



44 



gcc tat gaa cgc etc aca get gag gag atg gat gag cag agg egg cag 
Ala Tyr Glu Arg Leu Thr Ala Glu Glu Met Asp Glu Gin Arg Arg Gin 
15 20 25 



92 



aat gtt gcc tat cag tac ctg tgc egg ctg gag gag gcc aag cgc tgg 
Asn Val Ala Tyr Gin Tyr Leu Cys Arg Leu Glu Glu Ala Lys Arg Trp 
30 35 40 



140 



atg gag gcc tgc ctg aag gag gag ctt cct tec ccg gtg gag ctg gag 



188 
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Met Glu Ala Cys Leu Lys Glu Glu Leu Pro Ser Pro Val Glu Leu Glu 
45 50 55 

gag age ctt egg aat gga gtg etg ctg gee aag ctg gge cac tgt ttt 236 
Glu Ser Leu Arg Asn Gly Val Leu Leu Ala Lys Leu Gly His Cys Phe 
60 65 70 75 

gca ece tee gtg gtt cog ttg aag aag ate tac gat gtg gag cag ctg 284 
Ala Pro Ser Val Val Pro Leu Lys Lys lie Tyr Asp Val Glu Gin Leu 
80 85 90 

egg tac cag gca act ggc tta cat ttc cgt cac aca gac aac ate aac 332 
Arg Tyr Gin Ala Thr Gly Leu His Phe Arg His Thr Asp Asn He Asn 
95 100 105 

ttt tgg eta tet gca ata gee cac ate ggt ctg cct teg ace ttc ttc 380 
Phe Trp Leu Ser Ala He Ala His He Gly Leu Pro Ser Thr Phe Phe 
110 115 120 

cea gag acc aeg gae ate tat gac aaa aag aae atg ece egg gta gtc 428 
Pro Glu Thr Thr Asp He Tyr Asp Lys Lys Asn Met Pro Arg Val Val 
125 130 135 

tac tgc ate eat get etc agt etc ttc etc ttc egg etg gga ttg gee 476 
Tyr Cys He His Ala Leu Ser Leu Phe Leu Phe Arg Leu Gly Leu Ala 
140 145 150 155 

cct cag ata cat gat eta tac ggg aaa gtg aaa ttc aca get gag gaa 524 
Pro Gin He His Asp Leu Tyr Gly Lys Val Lys Phe Thr Ala Glu Glu 
160 165 170 

etc age aae atg geg tee gaa etg gee aaa tat ggc etc cag ctg cct 572 
Leu Ser Asn Met Ala Ser Glu Leu Ala Lys Tyr Gly Leu Gin Leu Pro 
175 180 185 

gee ttc age aag ate ggg ggc ate ttg gee aat gag etc teg gtg gat 620 
Ala Phe Ser Lys He Gly Gly He Leu Ala Asn Glu Leu Ser Val Asp 
190 195 200 

jag get gca gtc cat gca get gtt ctt gee ate aat gaa gca gtg gag 668 
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Glu Ala Ala Val His Ala Ala Val Leu Ala He Asn Glu Ala Val Glu 
205 210 215 



PCT/KR2003/002161 



cga ggg gtg gtg gag gac acc ctg get gcc ttg cag aat ccc agt get 
Arg Gly Val Val Glu Asp Thr Leu Ala Ala Leu Gin Asn Pro Ser Ala 
220 225 230 235 



716 



ctt ctg gag aat etc cga gag eet ctg gca gee gtc tac cag gag atg 
Leu Leu Glu Asn Leu Arg Glu Pro Leu Ala Ala Val Tyr Gin Glu Met 
240 245 250 



764 



ctg gee cag gee aag atg gag aag gca gcc aat gee agg aac eat gat 
Leu Ala Gin Ala Lys Met Glu Lys Ala Ala Asn Ala Arg Asn His Asp 
255 260 265 



812 



gac aga gaa age cag gac ate tat gac eac tac eta act cag get gaa 
Asp Arg Glu Ser Gin Asp He Tyr Asp His Tyr Leu Thr Gin Ala Glu 
270 275 280 



860 



ate cag ggc aat ate aae cat gtc aac gtc cat ggg get eta gaa gtt 
He Gin Gly Asn He Asn His Val Asn Val His Gly Ala Leu Glu Val 
285 290 295 



908 



gtt gat gat gee ctg gaa aga cag age eet gaa gee ttg etc aag gcc 
Val Asp Asp Ala Leu Glu Arg Gin Ser Pro Glu Ala Leu Leu Lys Ala 
300 305 310 315 



956 



ctt eaa gac eet gee ctg gee ctg cga ggg gtg agg aga gac ttt get 
Leu Gin TVsp Pro Ala Leu Ala Leu Arg Gly Val Arg Arg Asp Phe Ala 
320 325 330 



1004 



gac tgg tac ctg gag cag ctg aac tea gac aga gag cag aag gca cag 
Asp Trp Tyr Leu Glu Gin Leu Asn Ser Asp Arg Glu Gin Lys Ala Gin 
335 340 345 



1052 



gag ctg ggc ctg gtg gag ctt ctg gaa aag gag gaa gtc cag get ggt 

Glu Leu Gly Leu Val Glu Leu Leu Glu Lys Glu Glu Val Gin Ala Gly 

350 355 360 

gtg get gca gcc aac aca aag ggt gat cag gaa caa gcc atg etc cac 



1100 



1148 
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VaX Ala Ala Ala Asn Thr Lys Gly Asp Gin Glu Gin Ala Met Leu His 
365 370 375 



PCT/KR2003/002161 



get gtg cag egg ate aac aaa gee ate egg agg gga gtg gcg get gac 
Ala Val Gin Arg He Asn Lys Ala He Arg Arg Gly Val Ala Ala Asp 
380 385 390 395 



1196 



act gtg aag gag ctg atg tgc cot gag gcc cag ctg cct cca gtg tac 
Thr Val Lys Glu Leu Met Cys Pro Glu Ala Gin Leu Pro Pro Val Tyr 
400 405 410 



1244 



cct gtt gca teg tct atg tac cag ctg gag ctg gca gtg etc cag cag 
Pro Val Ala Ser Ser Met Tyr Gin Leu Glu Leu Ala Val Leu Gin Gin 
415 420 425 



1292 



cag cag ggg gag ctt ggc cag gag gag etc ttc gtg get gtg gag atg 
Gin Gin Gly Glu Leu Gly Gin Glu Glu Leu Phe Val Ala Val Glu Met 
430 435 440 



1340 



etc tea get gtg gte ctg att aae egg gee ctg gag gee egg gat gee 
Leu Ser Ala Val Val Leu He Asn Arg Ala Leu Glu Ala Arg Asp Ala 
445 450 455 



1388 



agt ggc ttc tgg age age ctg gtg aac cct gee aca ggc ctg get gag 
Ser Gly Phe Trp Ser Ser Leu Val Asn Pro Ala Thr Gly Leu Ala Glu 
460 465 470 475 



1436 



gtg gaa gga gaa aat gee cag cgt tac ttc gat gee ctg ctg aaa ttg 
Val Glu Gly Glu Asn Ala Gin Arg Tyr Phe Asp Ala Leu Leu Lys Leu 
480 485 490 



1484 



ega cag gag cgt ggg atg ggt gag gac ttc ctg age tgg aat gac ctg 
Arg Gin Glu Arg Gly Met Gly Glu Asp Phe Leu Ser Trp Asn Asp Leu 
495 500 505 



1532 



cag gcc acc gtg age cag gtc aat gca cag acc cag gaa gag act gac 
Sin Ala Thr Val Ser Gin Val Asn Ala Gin Thr Gin Glu Glu Thr Asp 
510 515 520 



1580 



=99 gtc ctt gca gtc ago etc ate aat gag get ctg gac aaa ggc age 



1628 



22/91 



wo 2004/035789 



Sequence Listing 



Arg Val Leu Ala Val Ser Leu He Asn Glu Ala Leu Asp Lys Gly Ser 
525 530 535 



PCT/KR2003/002161 



cct gaQ aag act ctg tct gcc eta ctg ctt cct gca get gge eta gat 
Pro Glu Lys Thr Leu Ser Ala Leu Leu Leu Pro Ala Ala Gly Leu Asp 
540 545 550 555 



1676 



gat gte age cte eet gte gee cet egg tac eat ete ete ett gtg gca 
Asp Val Ser Leu Pro Val Ala Pro Arg Tyr His Leu Leu Leu Val Ala 
560 565 570 



1724 



gcc aaa agg cag aag gcc cag gtg aea ggg gat eet gga get gtg ctg 
Ala Lys Arg Gin Lys Ala Gin Val Thr Gly Asp Pro Gly Ala Val Leu 
575 580 585 



1772 



tgg ctt gag gag ate ege cag gga gtg gte aga gcc aac cag gae act 
Trp Leu Glu Glu He Arg Gin Gly Val Val Arg Ala Asn Gin Asp Thr 
590 595 600 



1820 



aat aea get cag aga atg get ett ggt gtg get gee ate aat caa gee 
Asn Thr Ala Gin Arg Met Ala Leu Gly Val Ala Ala He Asn Gin Ala 
605 610 615 



1868 



ate aag gag ggc aag gca gee cag act gag egg gtg ttg agg aac cec 
He Lys Glu Gly Lys Ala Ala Gin Thr Glu Arg Val Leu Arg Asn Pro 
620 625 630 635 



1916 



gca gtg gcc ctt ega ggg gta gtt cec gac tgt gcc aac ggc tac cag 
Ala Val Ala Leu Arg Gly Val Val Pro Asp Cys Ala Asn Gly Tyr Gin 
640 645 650 



1964 



ega gcc ctg gaa agt gee atg gca aag aaa cag egt eca gca gac aea 
Arg Ala Leu Glu Ser Ala Met Ala Lys Lys Gin Arg Pro Ala Asp Thr 
655 660 665 



2012 



get tte tgg gtt eaa eat gae atg aag gat gge aet gee tae tac tte 
Ala Phe Trp Val Gin His Asp Met Lys Asp Gly Thr Ala Tyr Tyr Phe 
670 675 680 



2060 



cat ctg eag ace tte eag ggg ate tgg gag eaa cct eet ggc tgc cec 



2108 



23/91 
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Sequence Listing 



His Leu Gin Thr Phe Gin Gly lie Trp Glu Gin Pro Pro Gly Cys Pro 
685 690 695 



PCT/KR2003/002161 



etc aac acc tct cac ctg acc egg gag gag ate cag tea get gtc ace 
Leu Asn Thr Ser His Leu Thr Arg Glu Glu lie Gin Ser Ala Val Thr 
700 705 710 715 



2156 



aag gtc act get gee tat gac cgc caa cag etc tgg aaa gcc aac gtc 
Lys Val Thr Ala Ala Tyr Asp Arg Gin Gin Leu Trp Lys Ala Asn Val 
720 725 730 



2204 



ggc ttt gtt ate cag etc cag gcc cgc etc cgt ggc ttc eta gtt egg 
Gly Phe Val He Gin Leu Gin Ala Arg Leu Arg Gly Phe Leu Val Arg 
735 740 745 



2252 



cag aag ttt get gag cat tee eac ttt ctg agg ace tgg etc cca gea 
Gin Lys Phe Ala Glu His Ser His Phe Leu Arg Thr Trp Leu Pro Ala 
750 755 760 



2300 



gtc ate aag ate cag get cat tgg egg ggt tat agg cag egg aag att 
Val He Lys He Gin Ala His Trp Arg Gly Tyr Arg Gin Arg Lys He 
765 770 775 



2348 



tac ctg gag tgg ttg cag tat ttt aaa gea aac ctg gat gee ata ate 
Tyr Leu Glu Trp Leu Gin Tyr Phe Lys Ala Asn Leu Asp Ala He He 
780 785 790 795 



2396 



aag ate cag gee tgg gcc egg atg tgg gea get egg agg caa tac ctg 
Lys He Gin Ala Trp Ala Arg Met Trp Ala Ala Arg Arg Gin Tyr Leu 
800 805 810 



2444 



agg cgt ctg cac tac ttc cag aag aat gtt aac tec att gtg aag ate 
Arg Arg Leu His Tyr Phe Gin Lys Asn Val Asn Ser He Val Lys He 
815 820 825 



2492 



cag gea ttt ttc ega gcc agg aaa gcc caa gat gac tac agg ata tta 
Gin Ala Phe Phe Arg Ala Arg Lys Ala Gin Asp Asp Tyr Arg He Leu 
830 835 840 



2540 



gtg cat gea ccc cac cct ect etc agt gtg gta cgc aga ttt gcc cat 



2588 
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Val His Ala Pro His Pro Pro Leu Ser Val Val Arg Arg Phe Ala His 
845 850 855 



PCT/KR2003/002161 



etc ttg aat caa age cag caa gac ttc ttg get gag gca gag ctg ctg 
Leu Leu Asn Gin Ser Gin Gin Asp Phe Leu Ala Glu Ala Glu Leu Leu 
860 865 870 875 



2636 



aag etc cag gaa gag gta gtt agg aag ate cga tec aat cag cag ctg 
Lys Leu Gin Glu Glu Val Val Arg Lys lie Arg Ser Asn Gin Gin Leu 
880 885 890 



2684 



gag cag gac etc aac ate atg gac ate aag att ggc ctg ctg gtg aag 
Glu Gin Asp Leu Asn lie Met Asp He Lys He Gly Leu Leu Val Lys 
895 900 905 



2732 



aac egg ate act ctg cag gaa gtg gte tec cae tge aag aag ctg ace 
Asn Arg He Thr Leu Gin Glu Val Val Ser His Cys Lys Lys Leu Thr 
910 915 920 



2780 



aag agg aat aag gaa cag ctg tea gat atg atg gtt ctg gac aag cag 
Lys Arg Asn Lys Glu Gin Leu Ser Asp Met Met Val Leu Asp Lys Gin 
925 930 935 



2828 



aag ggt tta aag teg ctg age aaa gag aaa egg cag aaa eta gaa gca 
Lys Gly Leu Lys Ser Leu Ser Lys Glu Lys Arg Gin Lys Leu Glu Ala 
940 945 950 955 



2876 



tae caa cae etc ttc tac ctg etc cag act cag cec ate tac ctg gee 
Tyr Gin His Leu Phe Tyr Leu Leu Gin Thr Gin Pro He Tyr Leu Ala 
960 965 970 



2924 



aag ctg ate ttt cag atg eca cag aac aaa ace ace aag ttc atg gag 
Lys Leu He Phe Gin Met Pro Gin Asn Lys Thr Thr Lys Phe Met Glu 
975 980 985 



2972 



gca gtg att ttc age ctg tac aac tat gee tee age cge cga gag gee 
Ala Val He Phe Ser Leu Tyr Asn Tyr Ala Ser Ser Arg Arg Glu Ala 
990 995 1000 



3020 



tat etc ctg etc cag ctg ttc aag aca gca etc cag gag gaa ate aag 



3068 



25/91 
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Tyr Leu Leu Leu Gin Leu Phe Lys Thr Ala Leu Gin Glu Glu lie Lys 
1005 1010 1015 



PCT/KR2003/002161 



tea aag gtg gag cag ccc cag gac gtg gtg aca ggc aac cca aca gtg 
Ser Lys Val Glu Gin Pro Gin Asp Val Val Thr Gly Asn Pro Thr Val 
1020 1025 1030 1035 



3116 



gtg agg ctg gtg gtg aga ttc tac cgt aat ggg egg gga cag agt gcc 
Val Arg Leu Val Val Arg Phe Tyr Arg Asn Gly Arg Gly Gin Ser Ala 
1040 1045 1050 



3164 



ctg cag gag att ctg ggc aag gtt ate cag gat gtg eta gaa gac aaa 
Leu Gin Glu lie Leu Gly Lys Val lie Gin Asp Val Leu Glu Asp Lys 
1055 1060 1065 



3212 



gtg etc age gtc cac aca gac cet gtc eac etc tat aag aac tgg ate 
Val Leu Ser Val His Thr Asp Pro Val His Leu Tyr Lys Asn Trp lie 
1070 1075 1080 



3260 



aac cag act gag gcc cag aca ggg cag cgc age cat etc cca tat gat 
Asn Gin Thr Glu Ala Gin Thr Gly Gin Arg Ser His Leu Pro Tyr Asp 
1085 1090 1095 



3308 



gtc acc ccg gag cag gee ttg age eac ccc gag gtc cag aga ega ctg 
Val Thr Pro Glu Gin Ala Leu Ser His Pro Glu Val Gin Arg Arg Leu 
1100 1105 1110 1115 



3356 



gac ate gcc eta cgc aac etc etc gee atg act gat aag ttc ctt tta 
Asp lie Ala Leu Arg Asn Leu Leu Ala Met Thr Asp Lys Phe Leu Leu 
1120 1125 1130 



3404 



gcc ate acc tea tet gtg gac eaa att ccg tat ggg atg cga tat gtg 
Ala lie Thr Ser Ser Val Asp Gin lie Pro Tyr Gly Met Arg Tyr Val 
1135 1140 1145 



3452 



gcc aaa gtc ctg aag gca act ctg gea gag aaa ttc cet gac gcc aca 
Ala Lys Val Leu Lys Ala Thr Leu Ala Glu Lys Phe Pro Asp Ala Thr 
1150 1155 1160 



3500 



gac age gag gtc tat aag gtg gtc ggg aac etc ctg tac tac cgc ttc 



3548 
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Asp Ser Glu Val Tyr Lys Val Val Gly Asn Leu Leu Tyr Tyr Arg Phe 
1165 1170 1175 



ctg aac cca get gtg gtg get cct gac gee ttc gac att gtg gee atg 
Leu Asn Pro Ala Val Val Ala Pro Asp Ala Phe Asp lie Val Ala Met 
1180 1185 1190 1195 



3596 



gca get ggt gga gee ctg get gee ccc cag cgc cat gcc ctg ggg get 
Ala Ala Gly Gly Ala Leu Ala Ala Pro Gin Arg His Ala Leu Gly Ala 
1200 1205 1210 



3644 



gtg get cag etc eta cag cac get gcg get ggc aag gcc ttc tct ggg 
Val Ala Gin Leu Leu Gin His Ala Ala Ala Gly Lys Ala Phe Ser Gly 
1215 1220 1225 



3692 



cag age cag cac eta egg gtc ctg aat gac tat ctg gag gaa aca cac 
Gin Ser Gin His Leu Arg Val Leu Asn Asp Tyr Leu Glu Glu Thr His 
1230 1235 1240 



3740 



etc aag ttc agg aag ttc ate cat aga gcc tgc cag gtg cca gag cca 
Leu Lys Phe Arg Lys Phe He His Arg Ala Cys Gin Val Pro Glu Pro 
1245 1250 1255 



3788 



gag gag cgt ttt gca gtg gac gag tac tea gac atg gtg get gtg gcc 
Glu Glu Arg Phe Ala Val Asp Glu Tyr Ser Asp Met Val Ala Val Ala 
1260 1265 1270 1275 



3836 



aaa ccc atg gtg tac ate ace gtg ggg gag ctg gtc aac acg cac agg 
Lys Pro Met Val Tyr He Thr Val Gly Glu Leu Val Asn Thr His Arg 
1280 1285 1290 



3884 



ctg ttg ctg gag cac cag gac tgc att gcc cct gat cac caa gac ccc 
Leu Leu Leu Glu His Gin Asp Cys He Ala Pro Asp His Gin Asp Pro 
1295 1300 1305 



3932 



ctg cat gag etc ctg gag gat ctt ggg gag ctg ccc acc ate cct gac 
Leu His Glu Leu Leu Glu Asp Leu Gly Glu Leu Pro Thr lie Pro Asp 
1310 1315 1320 



3980 



ctt att ggt gag age ate get gca gat ggg cac aca gac ctg age aag 



4028 
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Leu He Gly Glu Ser He Ala Ala Asp Gly His Thr Asp Leu Ser Lys 
1325 1330 1335 
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eta gaa gtg tec ctg acg ctg acc aac aag ttt gaa gga eta gag gca 
Leu Glu Val Ser Leu Thr Leu Thr Asn Lys Phe Glu Gly Leu Glu Ala 
1340 1345 1350 1355 



4076 



gat get gat gac tec aac acc cgt ago ctg ctt ctg age acc aag cag 
Asp Ala Asp Asp Ser Asn Thr Arg Ser Leu Leu Leu Ser Thr Lys Gin 
1360 1365 1370 



4124 



ctg ttg gee gat ate ata cag ttc cat cct ggg gac acc etc aag gag 
Leu Leu Ala Asp He He Gin Phe His Pro Gly Asp Thr Leu Lys Glu 
1375 1380 1385 



4172 



ate ctg tee etc teg get tec aga gag caa gaa gca gee eac aag cag 
He Leu Ser Leu Ser Ala Ser Arg Glu Gin Glu Ala Ala His Lys Gin 
1390 1395 1400 



4220 



ctg atg age ega cge cag gee tgt aca gee cag aca eeg gag cca ctg 
Leu Met Ser Arg Arg Gin Ala Cys Thr Ala Gin Thr Pro Glu Pro Leu 
1405 1410 1415 



4268 



ega ega cac cge tea ctg aca get eac tee etc ctg cca ctg gca gag 
Arg Arg His Arg Ser Leu Thr Ala His Ser Leu Leu Pro Leu Ala Glu 
1420 1425 1430 1435 



4316 



aag cag egg cge gtc ctg egg aac ctg cge ega ctt gaa gee ctg ggg 
Lys Gin Arg Arg Val Leu Arg Asn Leu Arg Arg Leu Glu Ala Leu Gly 
1440 1445 1450 



4364 



ttg gtc age gee aga aat ggc tac cag ggg eta gtg gac gag ctg gee 
Leu Val Ser Ala Arg Asn Gly Tyr Gin Gly Leu Val Asp Glu Leu Ala 
1455 1460 1465 



4412 



aag gac ate cge aac cag cac aga cac agg eac agg egg aag gca gag 
Lys Asp He Arg Asn Gin His Arg His Arg His Arg Arg Lys Ala Glu 
1470 1475 1480 



4460 



ctg gtg aag ctg cag gcc aca tta cag ggc ctg age act aag acc acc 



4508 
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Leu Val I»ys Leu Gin Ala Thr Leu Gin Gly Leu Ser Thr Lys Thr Thr 
1485 1490 1495 

ttc tat gag gag cag ggt gac tac tac age cag tac ate egg gee tgc 4556 
Phe Tyr Glu Glu Gin Gly Asp Tyr Tyr Ser Gin Tyr He Arg Ala Cys 
1500 1505 1510 1515 

ctg gac cac ctg gcc ccc gac tec aag agt tct ggg aag ggg aag aag 4604 
Leu Asp His Leu Ala Pro Asp Ser Lys Ser Ser Gly Lys Gly Lys Lys 
1520 1525 1530 

cag ect tct ctt cat tac act get get cag etc ctg gaa aag ggt gtc 4652 
Gin Pro Ser Leu His Tyr Thr Ala Ala Gin Leu Leu Glu Lys Gly Val 
1535 1540 1545 

ttg gtg gaa att gaa gat ctt ccc gcc tct cac ttc aga aac gtc ate 4700 
Leu Val Glu He Glu Asp Leu Pro Ala Ser His Phe Arg Asn Val He 
1550 1555 1560 

ttt gac ate aeg ccg gga gat gag gca gga aag ttt gaa gta aat gcc 4748 
Phe Asp He Thr Pro Gly Asp Glu Ala Gly Lys Phe Glu Val Asn Ala 
1565 1570 1575 

aag ttc ctg ggt gtg gac atg gag cga ttt cag ctt cac tat cag gat 4796 
Lys Phe Leu Gly Val Asp Met Glu Arg Phe Gin Leu His Tyr Gin Asp 
1580 1585 1590 1595 

etc ctg cag etc cag tat gag ggt gtg get gtc atg aaa etc ttc aac 4844 
Leu Leu Gin Leu Gin Tyr Glu Gly Val Ala Val Met Lys Leu Phe Asn 
1600 1605 1610 

aag gee aaa gte aat gtc aac ctt etc ate ttc etc etc aac aag aag 4892 
Lys Ala Lys Val Asn Val Asn Leu Leu He Phe Leu Leu Asn Lys Lys 
1615 1620 1625 

ttt ttg egg aag tgacag aggcaaaggg tgctacccaa gcccctctta 4940 

Phe Leu Arg Lys 
1630 

cctctctgga tgctttcttt aacactaact caccactgtg cttccctgca gacacccaga 5000 
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gctcaggact gggcaaggcc cagggattct caccccttcc ccagctggga ggagcttgcc 5060 

tgcctggcca cagacagtgt atcttctaat tggctaaagt gggccttgcc cagagtccag 5120 

ctgtgtggct tttatcatgc atgacaaacc cctggctttc ctgccagatg gattctcatc 5180 

ccttacagct gactcttcca ggcaatttcc atagatctgc agtcctgcct ctgccacagt 5240 

ctctctgttg tccccacatc tacccaactt cctgtactgt tgcccttctg atgttaataa 5300 

aagcagctgt tactccc 5317 



<210> 4 

<211> 1631 

<212> PRT 

<213> Homo sapiens 

<400> 4 

Met Glu Arg Arg Ala Ala Gly Pro Gly Trp Ala Ala Tyr Glu Arg Leu 
15 10 15 

Thr Ala Glu Glu Met Asp Glu Gin Arg Arg Gin Asn Val Ala Tyr Gin 
20 25 30 

Tyr Leu Cys Arg Leu Glu Glu Ala Lys Arg Trp Met Glu Ala Cys Leu 
35 40 45 

Lys Glu Glu Leu Pro Ser Pro Val Glu Leu Glu Glu Ser Leu Arg Asn 
50 55 60 

Gly Val Leu Leu Ala Lys Leu Gly His Cys Phe Ala Pro Ser Val Val 
65 70 75 80 

Pro Leu Lys Lys lie Tyr Asp Val Glu Gin Leu Arg Tyr Gin Ala Thr 
85 90 95 

Gly Leu His Phe Arg His Thr Asp Asn lie Asn Phe Trp Leu Ser Ala 
100 105 110 
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He Ala His He Gly Leu Pro Ser Thr Phe Phe Pro Glu Thr Thr Asp 
115 120 125 

He Tyr Asp Lys Lys Asn Met Pro Arg Val Val Tyr Cys He His Ala 
130 135 140 

Leu Ser Leu Phe Leu Phe Arg Leu Gly Leu Ala Pro Gin He His Asp 
145 150 155 160 

Leu Tyr Gly Lys Val Lys Phe Thr Ala Glu Glu Leu Ser Asn Met Ala 
165 170 175 

Ser Glu Leu Ala Lys Tyr Gly Leu Gin Leu Pro Ala Phe Ser Lys He 
180 185 190 

Gly Gly He Leu Ala Asn Glu Leu Ser Val Asp Glu Ala Ala Val His 
195 200 205 

jy.a Ala Val Leu Ala He Asn Glu Ala Val Glu Arg Gly Val Val Glu 
210 215 220 



Asp Thr Leu Ala Ala Leu Gin Asn 
225 230 

Arg Glu Pro Leu Ala Ala Val Tyr 
245 

Met Glu Lys Ala Ala Asn Ala Arg 
260 

Asp He Tyr Asp His Tyr Leu Thr 
275 280 

Asn His Val Asn Val His Gly Ala 
290 295 

Glu Arg Gin Ser Pro Glu Ala Leu 
305 310 



Pro Ser Ala Leu Leu Glu Asn Leu 
235 240 

Gin Glu Met Leu Ala Gin Ala Lys 
250 255 ' 

Asn His Asp Asp Arg Glu Ser Gin 
265 270 

Gin Ala Glu He Gin Gly Asn He 
285 

Leu Glu Val Val Asp Asp Ala Leu 
300 

Leu Lys Ala Leu Gin Asp Pro Ala 
315 320 
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Leu Ala Leu Arg Gly Val Arg Arg T^p Phe Ala Asp Trp Tyr Leu Glu 
325 330 335 

Gin Leu Asn Ser Asp Arg Glu Gin Lys Ala Gin Glu Leu Gly Leu Val 
340 345 350 

Glu Leu Leu Glu Lys Glu Glu Val Gin Ala Gly Val Ala Ala Ala Asn 
355 360 365 

Thr Lys Gly Asp Gin Glu Gin Ala Met Leu His Ala Val Gin Arg lie 
370 375 380 

Asn Lys Ala He Arg Arg Gly Val Ala Ala Asp Thr Val Lys Glu Leu 
385 390 395 400 

Met Cys Pro Glu Ala Gin Leu Pro Pro Val Tyr Pro Val Ala Ser Ser 
405 410 415 

Met Tyr Gin Leu Glu Leu Ala Val Leu Gin Gin Gin Gin Gly Glu Leu 
420 425 430 

Gly Gin Glu Glu Leu Phe Val Ala Val Glu Met Leu Ser Ala Val Val 
435 440 . 445 

Leu He Asn Arg Ala Leu Glu Ala Arg Asp Ala Ser Gly Phe Trp Ser 
450 455 460 

Ser Leu Val Asn Pro Ala Thr Gly Leu Ala Glu Val Glu Gly Glu Asn 
465 470 475 480 

Ala Gin Arg Tyr Phe Asp Ala Leu Leu Lys Leu Arg Gin Glu Arg Gly 
485 490 495 

Met Gly Glu Asp Phe Leu Ser Trp Asn Asp Leu Gin Ala Thr Val Ser 
500 505 510 

Gin Val Asn Ala Gin Thr Gin Glu Glu Thr Asp Arg Val Leu Ala Val 
515 520 525 

Ser Leu lie Asn Glu Ala Leu Asp Lys Gly Ser Pro Glu Lys Thr Leu 
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530 535 540 

Ser Ala Leu Leu Leu Pro Ala Ala Gly Leu Asp Asp Val Ser Leu Pro 
545 550 555 560 

Val Ala Pro Arg Tyr His Leu Leu Leu Val Ala Ala Lys Arg Gin Lys 
565 570 575 

Ala Gin Val Thr Gly Asp Pro Gly Ala Val Leu Trp Leu Glu Glu lie 
580 585 590 

Arg Gin Gly Val Val Arg Ala Asn Gin Asp Thr Asn Thr Ala Gin Arg 
595 600 605 

Met Ala Leu Gly Val Ala Ala lie Asn Gin TQa lie Lys Glu Gly Lys 
610 615 620 

Ala Ala Gin Thr Glu Arg Val Leu Arg Asn Pro Ala Val Ala Leu Arg 
625 630 635 640 

Gly Val Val Pro T^p Cys Ala Asn Gly Tyr Gin Arg Ala Leu Glu Ser 
645 650 655 

Ala Met Ala Lys Lys Gin Arg Pro Ala Asp Thr Ala Phe Trp Val Gin 
660 665 670 

His Asp Met Lys Asp Gly Thr Ala Tyr Tyr Phe His Leu Gin Thr Phe 
675 680 685 

Gin Gly lie Trp Glu Gin Pro Pro Gly Cys Pro Leu Asn Thr Ser His 
690 695 700 

Leu Thr Arg Glu Glu He Gin Ser Ala Val Thr Lys Val Thr Ala Ala 
705 710 715 720 

Tyr Asp Arg Gin Gin Leu Trp Lys Ala Asn Val Gly Phe Val He Gin 
725 730 735 

Leu Gin Ala Arg Leu Arg Gly Phe Leu Val Arg Gin Lys Phe Ala Glu 
740 745 750 
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His Ser His Phe Leu Arg Thr Trp Leu Pro Ala Val lie Lys He Gin 
755 760 765 

Ala His Trp Arg Gly Tyr Arg Gin Arg Lys He Tyr Leu Glu Trp Leu 
770 775 780 

Gin Tyr Phe Lys Ala Asn Leu Asp Ala He He Lys He Gin Ala Trp 
785 790 795 800 

Ala Arg Met Trp Ala Ala Arg Arg Gin Tyr Leu Arg Arg Leu His Tyr 
805 810 815 

Phe Gin Lys Asn Val Asn Ser He Val Lys He Gin Ala Phe Phe Arg 
820 825 830 

Ala Arg Lys Ala Gin Asp Asp Tyr Arg He Leu Val His Ala Pro His 
835 840 845 

Pro Pro Leu Ser Val Val Arg Arg Phe Ala His Leu Leu Asn Gin Ser 
850 855 860 

Gin Gin Asp Phe Leu Ala Glu Ala Glu Leu Leu Lys Leu Gin Glu Glu 
865 870 875 880 

Val Val Arg Lys He Arg Ser Asn Gin Gin Leu Glu Gin Asp Leu Asn 
885 890 895 

He Met Asp He Lys He Gly Leu Leu Val Lys Asn Arg He Thr Leu 
900 905 910 

Gin Glu Val Val Ser His Cys Lys Lys Leu Thr Lys Arg Asn Lys Glu 
915 920 925 

Gin Leu Ser Asp Met Met Val Leu Asp Lys Gin Lys Gly Leu Lys Ser 
930 935 940 

Leu Ser Lys Glu Lys Arg Gin Lys Leu Glu Ala Tyr Gin His Leu Phe 
945 950 955 960 
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Tyr Leu Leu Gin Thr Gin Pro He Tyr Leu Ala Lys Leu He Phe Gin 
965 970 975 

Met Pro Gin Asn Lys Thr Thr Lys Phe Met Glu Ala Val He Phe Ser 
980 985 990 

Leu Tyr Asn Tyr Ala Ser Ser Arg Arg Glu Ala Tyr Leu Leu Leu Gin 
995 1000 1005 

Leu Phe Lys Thr Ala Leu Gin Glu Glu He Lys Ser Lys Val Glu Gin 
1010 1015 1020 

Pro Gin Asp Val Val Thr Gly Asn Pro Thr Val Val Arg Leu Val Val 
1025 1030 1035 1040 

Arg Phe Tyr Arg Asn Gly Arg Gly Gin Ser Ala Leu Gin Glu He Leu 
1045 1050 1055 

Gly Lys Val He Gin Asp Val Leu Glu Asp Lys Val Leu Ser Val His 
1060 1065 1070 

Thr Asp Pro Val His Leu Tyr Lys Asn Trp He Asn Gin Thr Glu Ala 
1075 1080 1085 

Gin Thr Gly Gin Arg Ser His Leu Pro Tyr Asp Val Thr Pro Glu Gin 
1090 1095 1100 

Ala Leu Ser His Pro Glu Val Gin Arg Arg Leu Asp He Ala Leu Arg 
1105 1110 1115 1120 

Asn Leu Leu Ala Met Thr Asp Lys Phe Leu Leu Ala He Thr Ser Ser 
1125 1130 1135 

Val Asp Gin He Pro Tyr Gly Met Arg Tyr Val Ala Lys Val Leu Lys 
1140 1145 1150 

Ala Thr Leu Ala Glu Lys Phe Pro Asp Ala Thr Asp Ser Glu Val Tyr 
1155 1160 1165 

Lys Val Val Gly Asn Leu Leu Tyr Tyr Arg Phe Leu Asn Pro Ala Val 
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1170 1175 1180 

Val Ala Pro Asp Ala Phe Asp lie Val Ala Met Ala Ala 61y Gly Ala 
1185 1190 1195 1200 

Leu Ala Ala Pro Gin Arg His Ala Leu Gly Ala Val Ala Gin Leu Leu 
1205 1210 1215 

Gin His Ala Ala Ala Gly Lys Ala Phe Ser Gly Gin Ser Gin His Leu 
1220 1225 1230 

Arg Val Leu Asn Asp Tyr Leu Glu Glu Thr His Leu Lys Phe Arg Lys 
1235 1240 1245 

Phe lie His Arg Ala Cys Gin Val Pro Glu Pro Glu Glu Arg Phe Ala 
1250 1255 1260 

Val Asp Glu Tyr Ser Asp Met Val Ala Val Ala Lys Pro Met Val Tyr 
1265 1270 1275 1280 

lie Thr Val Gly Glu Leu Val Asn Thr His Arg Leu Leu Leu Glu His 
1285 1290 1295 

Gin Asp Cys lie Ala Pro Asp His Gin Asp Pro Leu His Glu Leu Leu 
1300 1305 1310 

Glu Asp Leu Gly Glu Leu Pro Thr lie Pro Asp Leu lie Gly Glu Ser 
1315 1320 1325 

lie Ala Ala Asp Gly His Thr Asp Leu Ser Lys Leu Glu Val Ser Leu 
1330 1335 1340 

Thr Leu Thr Asn Lys Phe Glu Gly Leu Glu Ala Asp Ala Asp Asp Ser 
1345 1350 1355 1360 

Asn Thr Arg Ser Leu Leu Leu Ser Thr Lys Gin Leu Leu Ala Asp lie 
1365 1370 1375 

lie Gin Phe His Pro Gly Asp Thr Leu Lys Glu lie Leu Ser Leu Ser 
1380 1385 1390 
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Ala Ser Arg Glu Gin Glu Ala Ala His Lys Gin Leu Met Ser Arg Arg 
1395 1400 1405 

Gin Ala Cys Thr Ala Gin Thr Pro Glu Pro Leu Arg Arg His Arg Ser 
1410 1415 1420 

Leu Thr Ala His Ser Leu Leu Pro Leu Ala Glu Lys Gin Arg Arg Val 
1425 1430 1435 1440 

Leu Arg Asn Leu Arg Arg Leu Glu Ala Leu Gly Leu Val Ser Ala TVrg 
1445 1450 1455 

Asn Gly Tyr Gin Gly Leu Val Asp Glu Leu Ala Lys Asp He Arg Asn 
1460 1465 1470 

Gin His Arg His Arg His Arg Arg Lys Ala Glu Leu Val Lys Leu Gin 
1475 1480 1485 

Ala Thr Leu Gin Gly Leu Ser Thr Lys Thr Thr Phe Tyr Glu Glu Gin 
1490 1495 1500 

Gly Asp Tyr Tyr Ser Gin Tyr He Arg Ala Cys Leu Asp His Leu Ala 
1505 1510 1515 1520 

Pro Asp Ser Lys Ser Ser Gly Lys Gly Lys Lys Gin Pro Ser Leu His 
1525 1530 1535 

Tyr Thr Ala Ala Gin Leu Leu Glu Lys Gly Val Leu Val Glu He Glu 
1540 1545 1550 

Asp Leu Pro Ala Ser His Phe Arg Asn Val He Phe Asp He Thr Pro 
1555 1560 1565 

Gly Asp Glu Ala Gly Lys Phe Glu Val Asn Ala Lys Phe Leu Gly Val 
1570 1575 1580 

Asp Met Glu Arg Phe Gin Leu His Tyr Gin Asp Leu Leu Gin Leu Gin 
1585 1590 1595 1600 
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Tyr Glu 



Gly Val Ala Val Met Lys Leu Phe Asn Lys Ala Lys Val Asn 
1605 1610 1615 



Val Asn 



Leu Leu lie Phe Leu Leu Asn Lys Lys Phe Leu Arg Lys 
1620 1625 1630 



<210> 5 

<211> 3608 

<212> DHA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (424) (1908) 

<223> Clone LBFLllO 

<400> 5 

ggcggcgaga gcagctgcag ctcgcatctc aggcagtacc tagaggagct gccggtgcct 60 

cctcagaaca tctcctgatc gctacccagg accaggcacc aaggacaggg agtcccaggc 120 

gcacaccccc cattctgggt cccccaggcc cagaccccca ctctgccaca ggttgcatct 180 

tgacctggtc ctcctgcaga agtggcccct gtggtcctgc tctgagactc gtccctgggc 240 

gcccctgcag cccctttcta tgactccatc tggatttggc tggctgtggg gacgcggtcc 300 

gaggggcggc ctggctctca gcgtggtggc agccagctct ctggccacca tggcaaatgc 360 

tgagatctga ggggacaagg ctctacagcc tcagccaggg gcactcagct gttgcagggt 420 

gtg atg gag aac aaa get atg tac eta cac acc gtc age gac tgt 465 



Met Glu Asn Lys Ala Met Tyr Leu His Thr Val 
15 10 



Ser Asp Cys 



gac acc age tec ate tgt gag gat tec ttt gat ggc agg age ctg tec 



513 
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Asp Thr Ser Ser He Cys Glu Asp Ser Phe Asp Gly Arg Ser Leu Ser 
15 20 25 30 

aag ctg aac ctg tgt gag gat ggt cca tgt cac aaa egg egg gca age 561 
Lys Leu Asn Leu Cys Glu Asp Gly Pro Cys His Lys Arg Arg Ala Ser 
35 40 45 

ate tgc tgt acc cag ctg ggg tec ctg teg gee ctg aag cat get gtc 609 
He Cys Cys Thr Gin Leu Gly Ser Leu Ser 7U.a Leu Lys His Ala Val 
50 55 60 

etg ggg etc tac ctg ctg gte ttc ctg att ctt gtg ggc ate tte ate 657 
Leu Gly Leu Tyr Leu Leu Val Phe Leu He Leu Val Gly He Phe He 
65 70 75 

tta gea gtg tee agg ccg cge age tee eet gac gae ctg aag gee etg 705 
Leu Ala Val Ser Arg Pro Arg Ser Ser Pro Asp Asp Leu Lys Ala Leu 
80 85 90 

act cgc aat gtg aac egg ctg aat gag age ttc egg gac ttg cag ctg 753 
Thr Arg Asn Val Asn Arg Leu Asn Glu Ser Phe Arg Asp Leu Gin Leu 
95 100 105 110 

egg ctg ctg cag get ccg ctg caa gcg gac etg acg gag cag gtg tgg 801 
Arg Leu Leu Gin Ala Pro Leu Gin Ala Asp Leu Thr Glu Gin Val Trp 
115 120 125 

aag gtg cag gac gcg ctg cag aac cag tea gac teg ttg ctg gcg ctg 849 
Lys Val Gin Asp Ala Leu Gin Asn Gin Ser Asp Ser Leu Leu Ala Leu 
130 135 140 

gcg ggc gca gtg cag egg ctg gag ggc gcg ctg tgg ggg ctg cag gcg 897 
Ala Gly Ala Val Gin Arg Leu Glu Gly Ala Leu Trp Gly Leu Gin Ala 
145 150 155 

cag gcg gtg cag ace gag cag gcg gtg gee ctg ctg egg gae cgc acg 945 
Gin Ala Val Gin Thr Glu Gin Ala Val Ala Leu Leu Arg Asp Arg Thr 
160 165 170 

ggc cag cag age gac acg gcg cag ctg gag etc tac cag etg cag gtg 993 
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Gly Gin Gin Ser Asp Thr Ala Gin Leu Glu Leu Tyr Gin Leu Gin Val 
175 180 185 190 



PCT/KR2003/002161 



gag age aac agt age cag ctg ctg ctg agg cgc cac gcg ggc ctg ctg 
Glu Ser Asn Ser Ser Gin Leu Leu Leu Arg Arg His Ala Gly Leu Leu 
195 200 205 



1041 



gac ggg ctg gcg cgc agg gtg ggc ate ctg ggc gag gag ctg gcc gac 
Asp Gly Leu Ala TVrg Arg Val Gly lie Leu Gly Glu Glu Leu Ala Asp 
210 215 220 



1089 



gtg ggc ggc gtg ctg cgc ggc etc aac cac age ctg tec tac gac gtg 
Val Gly Gly Val Leu Arg Gly Leu Asn His Ser Leu Ser Tyr Asp Val 
225 230 235 



1137 



gcc etc cac cgc aeg egg ctg cag gac ctg egg gtg ctg gtg age aac 
Ala Leu His Arg Thr Arg Leu Gin Asp Leu Arg Val Leu Val Ser Asn 
240 245 250 



1185 



gcc age gag gac aeg cgc cgc ctg cgc ctg gcg cac gta ggc atg gag 
Ala Ser Glu Asp Thr Arg Arg Leu Arg Leu Ala His Val Gly Met Glu 
255 260 265 270 



1233 



ctg cag ctg aag cag gag ctg gee atg etc aac gcg gtc ace gag gac 
Leu Gin Leu Lys Gin Glu Leu Ala Met Leu Asn Ala Val Thr Glu Asp 
275 280 285 



1281 



ctg cgc etc aag gac tgg gag cac tee ate gea ctg egg aac ate tee 
Leu Arg Leu Lys Asp Trp Glu His Ser lie Ala Leu Arg Asn He Ser 
290 295 300 



1329 



etc gcg aaa ggg eea ccg gga eec aaa ggt gat cag ggg cat gaa gga 
Leu Ala Lys Gly Pro Pro Gly Pro Lys Gly Asp Gin Gly His Glu Gly 
305 310 315 



1377 



aag gaa ggc agg cet ggc ate cct gga ttg cct gga ctt cga ggt ctg 
Lys Glu Gly Arg Pro Gly He Pro Gly Leu Pro Gly Leu Arg Gly Leu 
320 325 330 



1425 



ccc ggg gag aga ggt acc cca gga ttg ccc ggg ccc aag ggc gat gat 



1473 
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Pro Gly Glu Arg Gly Thr Pro Gly Leu Pro Gly Pro Lys Gly Asp Asp 
335 340 345 350 

ggg aag ctg ggg gcc aca gga cca atg ggc atg cgt ggg ttc aaa ggt 1521 
Gly Lys Leu Gly Ala Thr Gly Pro Met Gly Met Arg Gly Phe Lys Gly 
355 360 365 

gac cga ggc cca aaa gga gag aaa gga gag aaa gga gac aga get ggg 1569 
Asp Arg Gly Pro Lys Gly Glu Lys Gly Glu Lys Gly Asp Arg Ala Gly 
370 375 380 

gat gcc agt ggc gtg gag gcc ccg atg atg ate cgc ctg gtg aat ggc 1617 
Asp Ala Ser Gly Val Glu Ala Pro Met Met He Arg Leu Val Asn Gly 
385 390 395 

tea ggt ccg cac gag ggc cgc gtg gaa gtg tac cac gac egg cgt tgg 1665 
Ser Gly Pro His Glu Gly Arg Val Glu Val Tyr His Asp Arg Arg Trp 
400 405 410 

ggc ace gtg tgt gac gac ggc tgg gac aag aag gac gga gac gtg gtg 1713 
Gly Thr Val Cys Asp Asp Gly Trp Asp Lys Lys Asp Gly Asp Val Val 
415 420 425 430 

tgc cgc atg etc ggc ttc ege ggt gtg gag gag gtg tac cgc aca get 1761 
Cys Arg Met Leu Gly Phe Arg Gly Val Glu Glu Val Tyr Arg Thr Ala 
435 440 445 

cga ttc ggg caa ggc act ggg agg ate tgg atg gat gac gtt gcc tgc 1809 
Arg Phe Gly Gin Gly Thr Gly Arg He Trp Met Asp Asp Val Ala Cys 
450 455 460 

aag ggc aca gag gaa ace ate ttc cgc tgc age ttc tec aaa tgg ggg 1857 
Lys Gly Thr Glu Glu Thr He Phe Arg Cys Ser Phe Ser Lys Trp Gly 
4 65 470 475 

gtg aca aac tgt gga cat gcc gaa gat gcc age gtg aca tgc aac aga 1905 
Val Thr Asn Cys Gly His Ala Glu Asp Ala Ser Val Thr Cys Asn Arg 
480 485 490 

cac tg aaagtgggca gagcccaagt tcggggtcct gcacagagea cccttcctgc 1960 
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His 
495 



atccctgggg tggggcacag ctcggggcca ccctgaccat gcctcgacca caccccgtcc 2020 

agcattctca gtcctcacac ctgcatccca ggaccgtggg ggccggtcgt catttccctc 2080 

ttgaacatgt gctccgaagt ataactctgg gacctactgc ccgtctctct cttccaccag 2140 

gttcctgcat gaggagccct gatcaactgg atcaccactt tgcccagcct ctgaacacca 2200 

tgcaccaggc ctcaatatcc cagttccctt tggcctttta gttacaggtg aatgctgaga 2260 

atgtgtcaga gacaagtgca gcagcagcga tggttggtag tatagatcat ttactcttca 2320 

gacaattccc aaacctccat tagtccaaga gtttctacat cttcctcccc agcaagaggc 2380 

aacgtcaagt gatgaatttc ccccctttac tctgcctctg ctccccattt gctagtttga 2440 

ggaagtgaca tagaggagaa gccagctgta ggggcaagag ggaaatgcaa gtcacctgca 2500 

ggaatccagc tagatttgga gaagggaatg aaactaacat tgaatgacta ccatggcacg 2560 

ctaaatagta tcttgggtgc caaattcatg tatccactta gctgcattgg tccagggcat 2620 

gtcagtctgg atacagcctt acctccaggt agcacttaac tggtccattc acctagactg 2680 

caagtaagaa gacaaaatga ctgagaccgt gtgcccacct gaacttattg tctttacttg 2740 

gcctgagcta aaagcttggg tgcaggacct gtgtaactag aaagttgcct acttcagaac 2800 

ctccagggcg tgagtgcaag gtcaaacatg actggcttcc aggccgacca tcaatgtagg 2860 

aggagagctg atgtggaggg tgacatgggg gctgcccatg ttaaacctga gtccagtgct 2920 

ctggcattgg gcagtcacgg ttaaagccaa gtcatgtgtg tctcagctgt ttggaggtga 2980 

bgattttgca tcttccaagc ctcttcaggt gtgaatctgt ggtcaggaaa acacaagtcc 3040 

taatggaacc cttagggggg aaggaaatga agattcccta taacctctgg gggtggggag 3100 
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taggaataag gggcttgggc ctccataaat ctgcaatctg caccctcctc ctagagacag 3160 

ggagatcgtg ttctgctttt tacatgagga gcagaactgg gccatacaca tgttcaagaa 3220 

ctaggggagc tacctggtag caagtgagtg cagacccacc tcaccttggg ggaatctcaa 3280 

actcataggc ctcagataca cgatcacctg tcatatcagg tgagcactgg cctgcttggg 3340 

gagagacctg ggcccctcca ggtgtaggaa cagcaacact cctggctgac aactaagcca 3400 

atatggccct aggtcattct tgcttccaat atgcttgcca ctccttaaat gtcctaatga 3460 

tgagaaactc tctttctgac caattgctat gtttacataa cacgcatgta ctcatgcatc 3520 

ccttgccaga gcccatatat gtatgcatat ataaacatag cactttttac tacatagctc 3580 

agcacattgc aaggtttgca tttaagtt 3608 



<210> 6 

<211> 495 

<212> PRT 

<213> Homo sapiens 

<400> 6 

Met Glu Asn Lys Ala Met Tyr Leu His Thr Val Ser Asp Cys Asp Thr 
15 10 15 

Ser Ser He Cys Glu Asp Ser Phe Asp Gly Arg Ser Leu Ser Lys Leu 
20 25 30 

Asn Leu Cys Glu Asp Gly Pro Cys His Lys Arg Arg Ala Ser He Cys 
35 40 45 

Cys Thr Gin Leu Gly Ser Leu Ser Ala Leu Lys His Ala Val Leu Gly 
50 55 60 

Leu Tyr Leu Leu Val Phe Leu He Leu Val Gly He Phe He Leu Ala 
65 70 75 80 
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Val Ser Arg Pro Arg Ser Ser Pro Asp Asp Leu Lys Ala Leu Thr Arg 
85 90 95 

Asn Val Asn Arg Leu Asn Glu Ser Phe Arg Asp Leu Gin Leu Arg Leu 
100 105 110 

Leu Gin Ala Pro Leu Gin Ala Asp Leu Thr Glu Gin Val Trp Lys Val 
115 120 125 

Gin Asp Ala Leu Gin Asn Gin Ser Asp Ser Leu Leu Ala Leu Ala Gly 
130 135 140 

Ala Val Gin Arg Leu Glu Gly Ala Leu Trp Gly Leu Gin Ala Gin Ala 
145 150 155 160 

Val Gin Thr Glu Gin Ala Val Ala Leu Leu TVrg Asp Arg Thr Gly Gin 
165 170 175 

Gin Ser Asp Thr Ala Gin Leu Glu Leu Tyr Gin Leu Gin Val Glu Ser 
180 185 190 

Asn Ser Ser Gin Leu Leu Leu Arg Arg His Ala Gly Leu Leu Asp Gly 
195 200 205 

Leu Ala Arg Arg Val Gly lie Leu Gly Glu Glu Leu Ala Asp Val Gly 
210 215 220 

Gly Val Leu Arg Gly Leu Asn His Ser Leu Ser Tyr Asp Val Ala Leu 
225 230 235 240 

His Arg Thr Arg Leu Gin Asp Leu Arg Val Leu Val Ser Asn Ala Ser 
245 250 255 

Glu Asp Thr Arg Arg Leu Arg Leu Ala His Val Gly Met Glu Leu Gin 
260 265 270 

Leu Lys Gin Glu Leu Ala Met Leu Asn Ala Val Thr Glu Asp Leu Arg 
275 280 285 
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Leu Lys Asp Trp Glu His Ser He Ala Leu Arg Asn He Ser Leu Ala 
290 295 300 

Lys Gly Pro Pro Gly Pro Lys Gly Asp Gin Gly His Glu Gly Lys Glu 
305 310 315 320 

Gly Arg Pro Gly He Pro Gly Leu Pro Gly Leu Arg Gly Leu Pro Gly 
325 330 335 

Glu Arg Gly Thr Pro Gly Leu Pro Gly Pro Lys Gly Asp Asp Gly Lys 
340 345 350 

Leu Gly Ala Thr Gly Pro Met Gly Met Arg Gly Phe Lys Gly Asp Arg 
355 360 365 

Gly Pro Lys Gly Glu Lys Gly Glu Lys Gly Asp Arg Ala Gly Asp Ala 
370 375 380 

Ser Gly Val Glu Ala Pro Met Met He Arg Leu Val Asn Gly Ser Gly 
385 390 395 400 

Pro His Glu Gly Arg Val Glu Val Tyr His Asp Arg Arg Trp Gly Thr 
405 410 415 

Val Cys Asp Asp Gly Trp Asp Lys Lys Asp Gly Asp Val Val Cys Arg 
420 425 430 

Met Leu Gly Phe Arg Gly Val Glu Glu Val Tyr Arg Thr Ala Arg Phe 
435 440 445 

Gly Gin Gly Thr Gly Arg He Trp Met Asp Asp Val Ala Cys Lys Gly 
450 455 460 

Thr Glu Glu Thr He Phe Arg Cys Ser Phe Ser Lys Trp Gly Val Thr 
465 470 475 480 



Asn Cys Gly His TVla Glu Asp Ala Ser Val Thr Cys Asn Arg His 
485 490 495 
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<210> 7 

<2H> 3X62 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (405) . . (1835) 

<223> Clone LBFL123 



<400> 7 

tggccggggt gccccgcaaa gagagacaag ggagaaaaag acagcaggga aaaccccggg 60 

gaggagaagg cgaaagagaa ggtggagctc agagaagggg gccggctccc cagctccatg 120 

tggccgccgc cgctgcgggt ctgtgggggc agagggcggc ggctcccagg gcagcgcgta 180 

gcgggaccga ttgcctaata ctccggcagg ggccggggcc gcagctggct cggataaata 240 

gccgcccggc tggcccggag ctgcagggga gagcggcggc cgcgatcccc accacaccac 300 

cagcccggcc gcacggggca ctgagccggg tgctgagcac cggaggcccc gccgaggccg 360 

ggactcagga cctgcagaga aacgcctcct gattttgtct taca atg gaa ctt 413 

Met Glu Leu 
1 

aaa aag teg cot gac ggt gga tgg ggc tgg gtg att gtg ttt gtc tec 461 
Lys Ztys Ser Pro Asp Gly Gly Trp Gly Trp Val lie Val Phe Val Ser 
5 10 15 

ttc ctt act cag ttt ttg tgt tac gga tec cca eta get gtt gga gtc 509 
Phe Leu Thr Gin Phe Leu Cys Tyr Gly Ser Pro Leu Ala Val Gly Val 
20 25 30 35 

etg tac ata gaa tgg etg gat gee ttt ggt gaa gga aaa gga aaa aca 557 
Leu Tyr lie Glu Trp Leu Asp Ala Phe Gly Glu Gly Lys Gly Lys Thr 
40 45 50 
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gcc tgg gtt gga tec ctg gca agt gga gtt ggc ttg ctt gca agt cct 
Ala Trp Val Gly Ser Leu Ala Ser Gly Val Gly Leu Leu Ala Ser Pro 
55 60 65 



605 



gtc tgc agt etc tgt gtc tea tct ttt gga gca aga cct gtc aca ate 
Val Cys Ser Leu Cys Val Ser Ser Phe Gly Ala Arg Pro Val Thr lie 
70 75 80 



653 



ttc agt ggc ttc atg gtg get gga ggc ctg atg ttg age agt ttt get 
Phe Ser Gly Phe Met Val Ala Gly Gly Leu Met Leu Ser Ser Phe Ala 
85 90 95 



701 



ccc aat ate tac ttt ctg ttt ttt tec tat ggc att gtt gta ggt tea 
Pro Asn He Tyr Phe Leu Phe Phe Ser Tyr Gly He Val Val Gly Ser 
100 105 110 115 



749 



age gtt ggc ctt ttc ata tat get get ctg cag agg atg ctg gtt gag 
Ser Val Gly Leu Phe He Tyr Ala Ala Leu Gin Arg Met Leu Val Glu 
120 125 130 



797 



ttc tat gga ctg gat gga tgc ttg ctg att gtg ggt get tta get tta 
Phe Tyr Gly Leu Asp Gly Cys Leu Leu He Val Gly Ala Leu Ala Leu 
135 , 140 145 



845 



aat ata tta gcc tgt ggc agt ctg atg aga ccc etc caa tct tct gat 
Asn He Leu Ala Cys Gly Ser Leu Met Arg Pro Leu Gin Ser Ser Asp 
150 155 160 



893 



tgt cct ttg cct aaa aaa ata get cea gaa gat eta cea gat aaa tac 
Cys Pro Leu Pro Lys Lys He Ala Pro Glu Asp Leu Pro Asp Lys Tyr 
165 170 175 



941 



tec att tac aat gaa aaa gga aag aat ctg gaa gaa aac ata aac att 
Ser He Tyr Asn Glu Lys Gly Lys Asn Leu Glu Glu Asn He Asn He 
180 185 190 195 



989 



ctt gac aag age tac agt agt gag gaa aaa tgc agg ate aeg tta gcc 
Leu Asp Lys Ser Tyr Ser Ser Glu Glu Lys Cys Arg He Thr Leu Ala 
200 205 210 



1037 
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aat ggt gac tgg aaa caa gac age eta ctt eat aaa aac ccc aea gtg 1085 
Asn Gly Asp Trp Lys Gin Asp Ser Leu Leu His Lys Asn Pro Thr Val 
215 220 225 

aea cac aea aaa gag cct gaa acg tac aaa aag aaa gtt gca gaa eag 1133 
Thr His Thr Lys Glu Pro Glu Thr Tyr Lys Lys Lys Val Ala Glu Gin 
230 235 240 

aca tat ttt tgc aaa cag ctt gee aag agg aag tgg cag tta tat aaa 1181 
Thr Tyr Phe Cys Lys Gin Leu Ala Lys Arg Lys Trp Gin Leu Tyr Lys 
245 250 255 

aac tac tgt ggt gaa act gtg get ctt ttt aaa aac aaa gta ttt tea 1229 
Asn Tyr Cys Gly Glu Thr Val Ala Leu Phe Lys Asn Lys Val Phe Ser 
260 265 270 275 

gee ctt ttc att get ate tta etc ttt gac ate gga ggg ttt cca cct 1277 
Ala Leu Phe lie Ala lie Leu Leu Phe Asp lie Gly Gly Phe Pro Pro 
280 285 290 

tea tta ctt atg gaa gat gta gca aga agt tea aac gtg aaa gaa gaa 1325 
Ser Leu Leu Met Glu Asp Val Ala Arg Ser Ser Asn Val Lys Glu Glu 
295 300 305 

gag ttt att atg cca ctt att tec att ata ggc att atg aca gca gtt 1373 
Glu Phe lie Met Pro Leu He Ser He He Gly He Met Thr Ala Val 
310 315 320 

ggt aaa ctg ctt tta ggg ata ctg get gac ttc aag tgg att aat ace 1421 
Gly Lys Leu Leu Leu Gly He Leu Ala Asp Phe Lys Trp He Asn Thr 
325 330 335 

ttg tat ctt tat gtt get ace tta ate ate atg ggc eta gee ttg tgt 1469 
Leu Tyr Leu Tyr Val Ala Thr Leu He He Met Gly Leu Ala Leu Cys 
340 345 350 355 

gca att cca ttt gee aaa age tat gtc aca ttg gcg ttg ctt tet ggg 1517 
Ala He Pro Phe Ala Lys Ser Tyr Val Thr Leu Ala Leu Leu Ser Gly 
360 365 370 
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ate eta ggg ttt ctt act ggt aat tgg tec ate ttt eca tat gtg acc 1565 
He Leu Gly Phe Leu Thr Gly Asn Trp Ser He Phe Pro Tyr Val Thr 
375 380 385 



acg aag act gtg gga att gaa aaa tta gee eat gee tat ggg at a tta 1613 
Thr Lys Thr Val Gly He Glu Lys Leu Ala His Ala Tyr Gly He Leu 
390 395 400 



atg ttc ttt get gga ett gga aat age eta gga cea eee ate gtt ggt 1661 
Met Phe Phe Ala Gly Leu Gly Asn Ser Leu Gly Pro Pro He Val Gly 
405 410 415 



tgg ttt tat gae tgg acc cag acc tat gat att gea ttt tat ttt agt 1709 
Trp Phe Tyr Asp Trp Thr Gin Thr Tyr Asp He Ala Phe Tyr Phe Ser 
420 425 430 435 



ggc ttc tge gte etg ctg gga ggt ttt att etg ctg ctg gea gee ttg 1757 

Gly Phe Cys Val Leu Leu Gly Gly Phe He Leu Leu Leu Ala Ala Leu 
440 445 450 

eee tct tgg gat aca tge aac aag caa etc eee aag eca get eca aca 1805 

Pro Ser Trp Asp Thr Cys Asn Lys Gin Leu Pro Lys Pro Ala Pro Thr 
455 460 465 



act ttc ttg tac aaa gtt gee tct aat gtt tagaa gaatattgga 1850 

Thr Phe Leu Tyr Lys Val Ala Ser Asn Val 
470 475 



agaeactatt tttgetattt tataceatat agcaacgata ttttaacaga teteaageaa 1910 

attttetaga gtcaagaeta ttt tct cat a geaaaattte acaatgaetg actctgaatg 1970 

aattattttt ttttttttat atateetatt ttttatgtag tgtatgegta gectctatet 2030 

cgtatttttt tetatttetc etccccacac eateaatggg actattctgt tttgetgtta 2090 

ttcaetagtt cttaaeattg taaaaagttt gaecageetc agaaggettt ctctgtgtaa 2150 

agaagtataa tttctctgct gaetccattt aatecactgc aaggcaeeta gagagaetgc 2210 
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tcctatttta aaagtgatgc aagcatcatg ataagatatg tgtgaagccc actaggaaat 2270 

aaatcattct cttctctatg tttgacttgc tagtaaacag aagacttcaa gccagccagg 2330 

aaattaaagt ggcgactaaa acagccttaa gaattgcagt ggagcaaatt ggtcattttt 2390 

taaaaaaata tattttaacc tacagtcacc agttttcatt attctattta cctcactgaa 2450 

gtactcgcat gttgtttggt acccactgag caactgtttc agttcctaag gtatttgctg 2510 

agatgtgggt gaactccaaa tggagaagta gtcactgtag actttcttca tggttgacca 2570 

ctccaacctt gctcactttt gcttcttggc catccactca gctgatgttt cctgggaagt 2630 

gctaatttta cctgtttcca aattggaaac acatttctca atcattccgt tctggcaaat 2690 

gggaaacatc catttgcttt gggcacagtg gggatgggct gcaagttctt gcatatcctc 2750 

ccagtgaagc atttatttgc tactatcaga ttttaccact atcaaatata attcaagggc 2810 

agaattaaac gtgagtgtgt gtgtgtgtgt gtgtgtgtgt gctatgcatg ctctaagtct 2870 

gcatgggata tgggaatgga aaagggcaat aagaaattaa tacccttatg cagttgcatt 2930 

taaccttaag aaaaatgtcc ttgggataaa ctccaatgtt taatacattg attttttttc 2990 

taaagaaatg ggttttaaac tttggtatgc atcagaattc cctatagatc tttttgaaaa 3050 

tataggtacc tgggtatcac acatagaact tttaattctg ctggtgtagg ctgttgccca 3110 

aacatctata attttactga gctcttcaag tgattctgat aacacagcct gg 3162 

<210> 8 

<211> 477 

<212> PRT 

<213> Homo sapiens 

<400> 8 

Met Glu Leu Lys Lys Ser Pro Asp Gly Gly Trp Gly Trp Val lie Val 
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15 10 15 

Phe Val Ser Phe Leu Thr Gin Phe Leu Cys Tyr Gly Ser Pro Leu Ala 
20 25 30 

Val Gly Val Leu Tyr lie Glu Trp Leu Asp Ala Phe Gly Glu Gly Lys 
35 40 45 

Gly Lys Thr Ala Trp Val Gly Ser Leu Ala Ser Gly Val Gly Leu Leu 
50 55 60 

Ala Ser Pro Val Cys Ser Leu Cys Val Ser Ser Phe Gly Ala Arg Pro 
65 70 75 80 

Val Thr lie Phe Ser Gly Phe Met Val Ala Gly Gly Leu Met Leu Ser 
85 90 95 

Ser Phe Ala Pro Asn He Tyr Phe Leu Phe Phe Ser Tyr Gly He Val 
100 105 110 

Val Gly Ser Ser Val Gly Leu Phe He Tyr Ala Ala Leu Gin Arg Met 
115 120 125 

Leu Val Glu Phe Tyr Gly Leu Asp Gly Cys Leu Leu He Val Gly Ala 
130 135 140 

Leu Ala Leu Asn He Leu Ala Cys Gly Ser Leu Met Arg Pro Leu Gin 
145 150 155 160 

Ser Ser Asp Cys Pro Leu Pro Lys Lys He Ala Pro Glu Asp Leu Pro 
165 170 175 

Asp Lys Tyr Ser He Tyr Asn Glu Lys Gly Lys Asn Leu Glu Glu Asn 
180 185 190 

He Asn He Leu Asp Lys Ser Tyr Ser Ser Glu Glu Lys Cys Arg He 
195 200 205 

Thr Leu Ala Asn Gly Asp Trp Lys Gin Asp Ser Leu Leu His Lys Asn 
210 215 220 
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Pro Thr Val Thr His Thr Lya Glu Pro Glu Thr Tyr Lys Lys Lys Val 
225 230 235 240 

Ala Glu Gin Thr Tyr Phe Cys Lys Gin Leu Ala Lys Arg Lys Trp Gin 
245 250 255 

Leu Tyr Lys Asn Tyr Cys Gly Glu Thr Val Ala Leu Phe Lya Asn Lys 
260 265 270 

Val Phe Ser Ala Leu Phe lie Ala lie Leu Leu Phe Asp lie Gly Gly 
275 280 285 

Phe Pro Pro Ser Leu Leu Met Glu Asp Val Ala Arg Ser Ser Asn Val 
290 295 300 

Lys Glu Glu Glu Phe lie Met Pro Leu lie Ser lie He Gly He Met 
305 310 315 320 

Thr Ala Val Gly Lys Leu Leu Leu Gly He Leu Ala Asp Phe Lys Trp 
325 330 335 

He Asn Thr Leu Tyr Leu Tyr Val Ala Thr Leu He He Met Gly Leu 
340 345 350 

Ala Leu Cys Ala He Pro Phe Ala Lys Ser Tyr Val Thr Leu Ala Leu 
355 360 365 

Leu Ser Gly He Leu Gly Phe Leu Thr Gly Asn Trp Ser He Phe Pro 
370 375 380 

Tyr Val Thr Thr Lys Thr Val Gly He Glu Lys Leu Ala His Ala Tyr 
385 390 395 400 

Gly He Leu Met Phe Phe Ala Gly Leu Gly Asn Ser Leu Gly Pro Pro 
405 410 415 

He Val Gly Trp Phe Tyr Asp Trp Thr Gin Thr Tyr Asp He Ala Phe 
420 425 430 
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Tyr Phe Ser Gly Phe Cys Val Leu Leu Gly Gly Phe He Leu Leu Leu 
435 440 445 

Ala Ala Leu Pro Ser Trp Asp Thr Cys Asn Lys Gin Leu Pro Lys Pro 
450 455 460 

Ala Pro Thr Thr Phe Leu Tyr Lys Val Ala Ser Aan Val 
465 470 475 



<210> 9 

<211> 4891 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (89) . . (1150) 

<223> Clone LBFL131 



<400> 9 

acgcctggtt cccgggaaga ctcgccagca ccagggggtg ggggagtgcg agctgaaagc 



60 



tgctggagag tgagcagccc tagcaggg atg gac atg atg ctg ttg gtg cag 112 

Met Asp Met Met Leu Leu Val Gin 
1 5 

ggt get tgt tgc teg aac cag tgg ctg gcg gcg gtg etc etc age ctg 160 
Gly Ala Cys Cys Ser Asn Gin Trp Leu Ala Ala Val Leu Leu Ser Leu 
10 15 20 

tgc tgc ctg eta ecc tec tgc etc ecg get gga cag agt gtg gac tte 208 
Cys Cys Leu Leu Pro Ser Cys Leu Pro Ala Gly Gin Ser Val Asp Phe 
25 30 35 40 

ccc tgg gcg gee gtg gac aac atg atg gtc aga aaa ggg gac acg gcg 256 
Pro Trp Ala Ala Val Asp Asn Met Met Val Arg Lys Gly Asp Thr Ala 
45 50 55 
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gtg ctt agg tgt tat ttg gaa gat gga get tea aag ggt gcc tgg ctg 304 
Val Leu Arg Cya Tyr Leu Glu Asp Gly Ala Ser Lys Gly Ala Trp Leu 
60 €5 70 

aac egg tea agt att att ttt gcg gga ggt gat aag tgg tea gtg gat 352 
Asn Arg Ser Ser lie He Phe Ala Gly Gly Asp Lys Trp Ser Val Asp 
75 80 85 

cet ega gtt tea att tea aea ttg aat aaa agg gac tae age ete eag 400 
Pro Arg Val Ser He Ser Thr Leu Asn Lys Arg Asp Tyr Ser Leu Gin 
90 95 100 

ata eag aat gta gat gtg aea gat gat gge eea tae aeg tgt tet gtt 448 
He Gin Asn Val Asp Val Thr Asp Asp Gly Pro Tyr Thr Cys Ser Val 
105 110 115 120 . 

cag act caa cat aea ccc aga aca atg cag gtg cat eta act gtg caa 496 
Gin Thr Gin His Thr Pro Arg Thr Met Gin Val His Leu Thr Val Gin 
125 130 135 

gtt cet ect aag ata tat gac ate tea aat gat atg ace gtc aat gaa 544 
Val Pro Pro Lys He Tyr Asp He Ser T^n Asp Met Thr Val Asn Glu 
140 145 150 

gga ace aac gtc act ctt act tgt ttg gcc act ggg aaa eca gag cet 592 
Gly Thr Asn Val Thr Leu Thr Cys Leu Ala Thr Gly Lys Pro Glu Pro 
155 160 165 

tee att tet tgg ega eae ate tec eea tea gcg aaa eea ttt gaa aat 640 
Ser He Ser Trp Arg His He Ser Pro Ser Ala Lys Pro Phe Glu Asn 
170 175 180 

gga caa tat ttg gac att tat gga att aea agg gac eag get ggg gaa 688 
Gly Gin Tyr Leu Asp He Tyr Gly He Thr Arg Asp Gin Ala Gly Glu 
185 190 195 200 

tat gaa tge agt gcg gaa aat gat gtg tea tte eea gat gtg agg aaa 736 
Tyr Glu Cys Ser Ala Glu Asn Asp Val Ser Phe Pro Asp Val Arg Lys 
205 210 215 
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gta aaa gtt gtt gtc aac ttt get cct act att cag gaa att aaa tct 784 
Val Lys Val Val Val Asn Phe Ala Pro Thr lie Gin Glu lie Lys Ser 
220 225 230 

ggc acc gtg acc ccc gga cgc agt ggc ctg ata aga tgt gaa ggt gca 832 
Gly Thr Val Thr Pro Gly Arg Ser Gly Leu He Arg Cys Glu Gly Ala 
235 240 245 

ggt gtg ccg cct cca gcc ttt gaa tgg tac aaa gga gag aag aag etc 880 
Gly Val Pro Pro Pro Ala Phe Glu Trp Tyr Lys Gly Glu Lys Lys Leu 
250 255 260 

ttc aat ggc caa caa gga att att att caa aat ttt age aca aga tec 928 
Phe Asn Gly Gin Gin Gly He He He Gin Asn Phe Ser Thr Arg Ser 
265 270 275 280 

att etc act gtt acc aac gtg aca cag gag cac ttc ggc aat tat act 976 
He Leu Thr Val Thr Asn Val Thr Gin Glu His Phe Gly Asn Tyr Thr 
285 290 295 

tgt gtg get gcc aac aag eta ggc aca acc aat gcg age ctg cct ctt 1024 
Cys Val Ala Ala Asn Lys Leu Gly Thr Thr Asn Ala Ser Leu Pro Leu 
300 305 310 

aac cct cca agt aca gee cag tat gga att ace ggg age get gat gtt 1072 
Asn Pro Pro Ser Thr Ala Gin Tyr Gly He Thr Gly Ser Ala Asp Val 
315 320 325 

ctt ttc tee tge tgg tac ctt gtg ttg aca ctg tec tct ttc ace age 1120 
Leu Phe Ser Cys Trp Tyr Leu Val Leu Thr Leu Ser Ser Phe Thr Ser 
330 335 340 

ata ttc tac ctg aag aat gcc att eta caa taaattcaaa gacceataaa 1170 
He Phe Tyr Leu Lys Asn Ala He Leu Gin 
345 350 

aggcttttaa ggattctctg aaagtgctga tggctggatc caatctggta cagtttgtta 1230 
aaagcagegt gggatataat cageagtgct tacatgggga tgatcgeett ctgtagaatt 1290 
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gctcattatg taaatacttt aattctactc ttttttgatt agctacatta ccttgtgaag 1350 

cagtacacat tgtccttttt ttaagacgtg aaagctctga aattactttt agaggatatt 1410 

aattgtgatt tcatgtttgt aatctacaac ttttcaaaag cattcagtca tggtctgcta 1470 

ggttgcaggc tgtagtttac aaaaacgaat attgcagtga atatgtgatt ctttaaggct 1530 

gcaatacaag cattcagttc cctgtttcaa taagagtcaa tccacattta caaagatgca 1590 

tttttttctt ttttgataaa aaagcaaata atattgcctt cagattattt cttcaaaata 1650 

taacacatat ctagattttt ctgctcgcat gatattcagg tttcaggaat gagccttgta 1710 

atataactgg ctgtgcagct ctgcttctct ttcctgtaag ttcagcatgg gtgtgccttc 1770 

atacaataat atttttctct ttgtctccaa ctaatataaa atgttttgct aaatcttaca 1830 

atttgaaagt aaaaataaac cagagtgatc aagttaaacc atacactatc tctaagtaac 1890 

gaaggagcta ttggactgta aaaatctctt cctgcactga caatggggtt tgagaatttt 1950 

gccccacact aactcagttc ttgtgatgag agacaattta ataacagtat agtaaatata 2010 

ccatatgatt tctttagttg tagctaaatg ttagatccac cgtgggaaat tattcccttt 2070 

aaaatgacag cacagtccac tcaaaggatt gcctagcaat acagcatctt ttcctttcac 2130 

tagtccaagc caaaaatttt aagatgattt gtcagaaagg gcacaaagtc ctatcaccta 2190 

atattacaag agttggtaag cgctcatcat taattttatt ttgtggcagc taagttagta 2250 

tgacagaggc agtgctcctg tggacaggag cattttgcat attttccatc tgaaagtatc 2310 

actcagttga tagtctggaa tgcatgttat atattttaaa acttccaaaa tatattataa 2370 

caaacattct atatcggtat gtagcagacc aatctctaaa atagctaatt cttcaataaa 2430 

atctttctat atagccattt cagtgcaaac aagtaaaatc aaaaaagacc atcctttatt 2490 
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tttccttaca tgatatatgt aagatgcgat caaataaaga caaaacacca gtgatgagaa 2550 

tatcttaaga taagtaatta tcaaattatt gtgaatgtta aattatttct actataaaga 2610 

agcaaaacta catttttgaa ggaaaatgct gttactctaa cattaattta caggaatagt 2670 

ttgatggttt cactctttac taaagaaagg ccatcacctt gaaagccatt ttacaggttt 2730 

gatgaagtta ccaatttcag tacacctaaa tttctacaaa tagtcccctt ttacaagttg 2790 

taacaacaaa gaccctataa taaaattaga tacaagaaat tttgcagtgg ttatacatat 2850 

ttgagatatc tagtatgttg ccctagcagg gatggcttaa aaactgtgat tttttttctt 2910 

caagtaaaac ttagtcccaa agtacatcat aaatcaattt taactagaaa aatgaatctt 2970 

aaatgagggg acataagtat actctttcca caaaatggca ataataaggc ataaagctag 3030 

taaatctact aactgtaata aatgtatgac attattttga ttgatacatt aaaaaagagt 3090 

ttttagaaca aatatggcat ttaactttat tatttatttg cttttaagaa atattctttg 3150 

tggaattgtt gaataaacta taaaatatta ttttgtattg cagctttaaa gtggcacact 3210 

ccataataat ctacctacta gaaatagtgg tgctaccaca aaaaatgtta accatcagta 3270 

ccattgtttg ggagaaagaa acaggtcaag aatgcatatt attcagtgac cgctttccta 3330 

gagttaaaat acctcctctt tgtaaggttt gtaggtaaat tgaggtataa actatggatg 3390 

aaccaaataa ttagttcaaa gtgttgtcat gattccaaat ttgtggagtc tggtgttttt 3450 

accatagaat gtgacagaag tacagtcata gctcagtagc tatatgtatt tgcctttatg 3510 

ttagaagaga ctttcttgag tgacattttt aaatagagga ggtattcact atgtttttct 3570 

gtatcacagc agcattccta gtccttaggc cctcggacag agtgaaatca tgagtattta 3630 

tgagttcaat attgtcaaat aaggctacag tatttgcttt tttgtgtgaa tgtattgcat 3690 
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ataatgttca agtagatgat tttacattta tggacatata aaacgtctga ttaccccatt 3750 

ttatcagtcc tgactgtaca agattgttgc aatttcagaa tagcagtttt ataaattgat 3810 

ttatctttta atctataaca atttgtgtta gctgttcatt tcaggattat attttctaca 3870 

agttccactt gtgggactcc ttttgttgcc cctatttttt tttaaagaag gaagaaagaa 3930 

aaatgagtag cagtttaaaa atgagaatgg agagaaaaga aaaagaatga aaaggaaagg 3990 

cagtaaagag ggaaaaaaaa ggaaggatgg aaggaatgaa ggaaggaagg gaggaagggg 4050 

agaaggtagg aagaaagaaa ggatgagagg gaaggaagaa tcagagtatt agggtagtta 4110 

acttacacat ttgcattctt agttatactg caagtggtgt aactatgttt ttcaatgatc 4170 

gcatttgaaa cataagtcct attataccat taagttccta ttatgcagca attatataat 4230 

aaaaagtact gcccaagtta tagtaatgtg ggtgtttttg agacactaaa agatttgaga 4290 

gggagaattt caaacttaaa gccacttttg gggggtttat aacttaactg aaaaattaat 4350 

gcttcatcat aacatttaag ctatatctag aaagtagact ggagaactga gaaaattacc 4410 

caggtaattc agggaaaaaa aaaaaatata tatatatata aataccccta catttgaagt 4470 

cagaaaactc tgaaaaactg aattatcaaa gtcaatcatc tataatgatc aaatttactg 4530 

aacaattgtt aatttatcca ttgtgcttag ctttgtgaca cagccaaaag ttacctattt 4590 

aatcttttca ataaaaattg ttttttgaaa tccagaaatg atttaaaaag aggtcaggtt 4650 

tttaactatt tattgaagta tgtggatgtc cagtatttca atagatatga atatgaataa 4710 

atggtatgcc ttaagattct ttgaatatgt atttacttta aagactggaa aaagctcttc 4770 

ctgtctttta gtaaacatcc atatttcata acctgatgta aaatatgttg tactgtttcc 4830 

aataggggaa tataaactca gtttatcaat taaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4890 
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a 4891 



<210> 10 

<211> 354 

<212> PRT 

<213> Homo sapiens 

<400> 10 

Met Asp Met Met Leu Leu Val Gin Gly Ala Cys Cys Ser Asn Gin Trp 
15 10 15 

Leu 70.a Ala Val Leu Leu Ser Leu Cys Cys Leu Leu Pro Ser Cys Leu 
20 25 30 

Pro Ala Gly Gin Ser Val Asp Phe Pro Trp Ala Ala Val Asp Asn Met 
35 40 45 

Met Val Arg Lys Gly Asp Thr Ala Val Leu Arg Cys Tyr Leu Glu Asp 
50 55 60 

Gly Ala Ser Lys Gly Ala Trp Leu Asn Arg Ser Ser lie lie Phe Ala 
65 70 75 80 

Gly Gly Asp Lys Trp Ser Val Asp Pro Arg Val Ser lie Ser Thr Leu 
85 90 95 

Asn Lys Arg Asp Tyr Ser Leu Gin lie Gin Asn Val Asp Val Thr Asp 
100 105 110 

Asp Gly Pro Tyr Thr Cys Ser Val Gin Thr Gin His Thr Pro Arg Thr 
115 120 125 

Met Gin Val His Leu Thr Val Gin Val Pro Pro Lys lie Tyr Asp lie 
130 135 140 

Ser Asn Asp Met Thr Val Asn Glu Gly Thr Asn Val Thr Leu Thr Cys 
145 150 155 160 

Cieu Ala Thr Gly Lys Pro Glu Pro Ser lie Ser Trp Arg His lie Ser 
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165 170 175 

Pro Ser Ala Lys Pro Phe Glu Asn Gly Gin Tyr Leu Asp lie Tyr Gly 
180 185 190 

lie Thr Arg Asp Gin Ala Gly Glu Tyr Glu Cys Ser Ala Glu Asn Asp 
195 200 205 

Val Ser Phe Pro Asp Val Arg Lys Val Lys Val Val Val Asn Phe Ala 
210 215 220 

Pro Thr lie Gin Glu lie Lys Ser Gly Thr Val Thr Pro Gly Arg Ser 
225 230 235 240 

Gly Leu lie Arg Cys Glu Gly Ala Gly Val Pro Pro Pro Ala Phe Glu 
245 250 255 

Trp Tyr Lys Gly Glu Lys Lys Leu Phe Asn Gly Gin Gin Gly lie He 
260 265 270 

He Gin Asn Phe Ser Thr Arg Ser He Leu Thr Val Thr Asn Val Thr 
275 280 285 

Gin Glu His Phe Gly Asn Tyr Thr Cys Val Ala Ala Asn Lys Leu Gly 
290 295 300 

rhr Thr Asn Ala Ser Leu Pro Leu Asn Pro Pro Ser Thr Ala Gin Tyr 
305 310 315 320 

Sly He Thr Gly Ser Ala Asp Val Leu Phe Ser Cys Trp Tyr Leu Val 
325 330 335 

Leu Thr Leu Ser Ser Phe Thr Ser He Phe Tyr Leu Lys Asn Ala He 
340 345 350 

jeu Gin 



:210> 11 
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<211> 3098 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> ' CDS 

<222> (223) . . (1569) 

<223> IiBFL164 

<400> 11 

gtttatcgcg cacatctcgc ggcgaggagg agaggccgga agggcgcccc agccccaagg 60 

ctcctgcccc gcctgggcct ccggctttcg tttccccgca acgcttcgct ttcgtttccc 120 

gctggcgcct ggctccctcc gggtttcgtt tcccgccggc gcctggctcc cgccaggttt 180 

cgtttccgag gcggggccga gggcggcgtc gctgaggcgc cc atg gcc ttc 231 

Met Ala Phe 
1 

gcc cgc egg etc ctg cgc ggg cca ctg teg ggg ccg ctg etc ggg egg 279 
Ala Arg Arg Leu Leu Arg Gly Pro Leu Ser Gly Pro Leu Leu Gly Arg 
5 10 15 

cgc ggg gtc tge get ggg gee atg get ceg eeg cgc cgc ttc gtc ctg 327 
Arg Gly Val Cys Ala Gly Ala Met Ala Pro Pro Arg Arg Phe Val Leu 
20 25 30 35 

gag ett ecc gac tge ace ctg get eac ttc gee eta ggc gee gae gee 375 
Glu Leu Pro Asp Cys Thr Leu Ala His Phe Ala Leu Gly Ala Asp Ala 
40 45 50 

ecc ggc gac gea gac gcc ecc gae ecc cgc ctg gcg gcg ctg ctg ggg 423 
Pro Gly Asp Ala Asp Ala Pro Asp Pro Arg Leu Ala Ala Leu Leu Gly 
55 60 65 

ecc ccg gag cgc age tac teg ctg tge gtg ecc gtg ace ccg gac gcc 471 
Pro Pro Glu Arg Ser Tyr Ser Leu Cys Val Pro Val Thr Pro Asp Ala 
70 75 80 
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ggc tgc ggg gcc egg gtc egg gcg gcg egg ctg cac cag cgc ctg ctg 519 

Gly Cys Gly Ala Arg Val Arg Ala Ala Arg Leu His Gin Arg Leu Leu 

85 90 95 

cac cag ctg cgc cgc ggc ccc ttc cag egg tgc cag ctg etc agg ctg 567 

His Gin Leu Arg Arg Gly Pro Phe Gin Arg Cys Gin Leu Leu Arg Leu 

100 105 110 115 

etc tgc tac tgc ccg ggc ggc cag gcc ggc ggc gca cag caa ggc ttc 615 

Leu Cys Tyr Cys Pro Gly Gly Gin Ala Gly Gly Ala Gin Gin Gly Phe 

120 125 130 

ctg ctg cgc gac ccc ctg gat gac cct gac acc egg caa gcg ctg etc 663 

Leu Leu Arg Asp Pro Leu Asp Asp Pro Asp Thr Arg Gin Ala Leu Leu 

135 140 145 

gag ctg ctg ggc gcc tgc cag gag gca cca cgc ccg cac ttg ggc gag 711 

Glu Leu Leu Gly Ala Cys Gin Glu Ala Pro Arg Pro His Leu Gly Glu 

150 155 160 

ttc gag gcc gac ccg cgc ggc cag ctg tgg cag cgc etc tgg gag gtg 759 

Phe Glu Ala Asp Pro Arg Gly Gin Leu Trp Gin Arg Leu Trp Glu Val 

165 170 175 

caa gac ggc agg egg ctg cag gtg ggc tgc gca cag gtc gtg ccc gtc 807 
Gin Asp Gly Arg Arg Leu Gin Val Gly Cys Ala Gin Val Val Pro Val 

180 185 190 195 

ccg gag ccc ccg ctg cac ccg gtg gtg cca gac ttg ccc agt tec gtg 855 

Pro Glu Pro Pro Leu His Pro Val Val Pro Asp Leu Pro Ser Ser Val 

200 205 210 

gtc ttc ccg gac egg gaa gcc gcc egg gee gtt ttg gag gag tgt ace 903 

Val Phe Pro Asp Arg Glu Ala Ala Arg Ala Val Leu Glu Glu Cys Thr 

215 220 225 

tec ttt att cct gaa gcc egg gca gtg ett gac ctg gtc gac cag tgc 951 

Ser Phe lie Pro Glu Ala Arg Ala Val Leu Asp Leu Val Asp Gin Cys 

230 235 240 
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cca aaa cag ate cag aaa gga aag ttc cag gtt gtt gcc ate gaa gga 
Pro Lys 61n lie 6Xn Lys Gly Lys Phe Gin Val Val /Via Xle Glu Gly 
245 250 255 



999 



ctg gat gcc acg ggt aaa acc acg gtg ace cag tea gtg gca gat tea 
Leu Asp Ala Thr Gly Lys Thr Thr Val Thr Gin Ser Val Ala Asp Ser 
260 265 270 275 



1047 



ett aag get gtc etc tta aag tea cca ccc tct tgc att ggc cag tgg 
Leu Lys Ala Val Leu Leu Lys Ser Pro Pro Ser Cys lie Gly Gin Trp 
280 285 290 



1095 



agg aag ate ttt gat gat gaa cca act ate att aga aga get ttt tac 
Arg Lys He Phe Asp Asp Glu Pro Thr He He Arg Arg Ala Phe Tyr 
295 300 305 



1143 



tct ttg ggc aat tat att gtg gcc tec gaa ata get aaa gaa tct gcc 
Ser Leu Gly Asn Tyr He Val Ala Ser Glu He Ala Lys Glu Ser Ala 
310 315 320 



1191 



aaa tet eet gtg att gta gac agg tac tgg cae age acg gcc acc tat 
Lys Ser Pro Val He Val Asp Arg Tyr Trp His Ser Thr Ala Thr Tyr 
325 330 335 



1239 



get ata gee act gag gtg agt ggg ggt etc cag cae etg eee eea gcc 
Ala He Ala Thr Glu Val Ser Gly Gly Leu Gin His Leu Pro Pro Ala 
340 345 350 355 



1287 



eat cae eet gtg tac eag tgg cca gag gac etg etc aaa eet gae ett 
His His Pro Val Tyr Gin Trp Pro Glu Asp Leu Leu Lys Pro Asp Leu 
360 365 370 



1335 



ate etg etg etc act gtg agt eet gag gag agg ttg cag agg etg cag 
He Leu Leu Leu Thr Val Ser Pro Glu Glu Arg Leu Gin Arg Leu Gin 
375 380 385 



1383 



ggc egg ggc atg gag aag ace agg gaa gaa gca gaa ett gag gee aac 
Gly Arg Gly Met Glu Lys Thr Arg Glu Glu Ala Glu Leu Glu Ala Asn 
390 395 400 



1431 
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agt gtg ttt cgt caa aag gta gaa atg tec tac cag egg atg gag aat 1479 
Ser Val Phe Arg Gin Lys Val Glu Met Ser Tyr Gin Arg Met Glu Asn 
405 410 415 

cct ggc tgc cat gtg gtt gat gcc age ccc tec aga gaa agg gte etg 1527 
Pro Gly Cys His Val Val Asp Ala Ser Pro Ser Arg Glu hxg Val Leu 
420 425 430 435 

eag aeg gta tta age eta ate eag aat agt ttt agt gaa eeg t 1570 

Gin Thr Val Leu Ser Leu lie Gin Asn Ser Phe Ser Glu Pro 
440 445 

agttaetetg geeaggtgce aegtetaaet agattagatg ttgtttgaaa eatctaeate 1630 

eaceatttgt tatgeagtgt teccaaattt ctgttctaea ageatgttgt gtggeagaaa 1690 

aetggagace aggeatctta attttactte ageeatcgta ecetcttctg actgatggac 1750 

ecgteateae aaaggtccet cteateatgt teeagtgaga ^ggecagegat tgetttette 1810 

etggeatagt aaaeatttte ttggaacata tgtttcaett aatcaetaee aaatatctgg 1870 

aagaeetgte ttacteagae ageaeeaggt gtaeagaage ageagacaag atetteeaga 1930 

tcageaggga gaceeeggag cctetgcttc tcetaeaetg geatgetgat gagategtga 1990 

eatgeeeaea ttggettett eeaeatctgg ttgeaetegt eatgatggge tegetgeatc 2050 

teeeteagtc eeaaattcta gagecaagtg ttcctgcaga ggctgtctat gtgtcetgge 2110 

tgcccaagga caetcctgca gagecatttt tgggtaagga acaettaeaa agaaggcatt 2170 

gatettgtgt etgaggetca gageeetttt gataggettc tgagteatat ataaagacat 2230 

tcaagccaag atgetecaac tgcaaatata ccaaccttet etgaattata ttttgettat 2290 

ttatatttet tttctttttt tetaaagtat ggetetgaat agaatgeaca tttteeattg 2350 

aactggatgc atttcattta gccaatccag taatttattt atattaatet atacaatatg 2410 
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tttcctcagc ataggagcta tgattcatta attaaaagtg gagtcaaaac gctaaatgca 2470 

atgtttijttg tgtattttca ttacacaaac ttaatttgtc ttgttaaata agtacagtgg 2530 

atcttggagt gggatttctt ggtaaattat cttgcacttg aatgtctcat gattacatat 2590 

gaaatcgctt tgacatatct ttagacagaa aaaagtagct gagtgagggg gaaattatag 2650 

agctgtgtga ctttagggag taggttgaac caggtgatta cctaaaattc cttccagttc 2710 

aaaggcagat aaatctgtaa attattttat cctatctacc atttcttaag aagacattac 2770 

tccaaaataa ttaaatttaa ggctttatca ggtctgcata tagaatctta aattctaata 2830 

aagtttcatg ttaatgtcat aggattttta aaagagctat aggtaatttc tatataatat 2890 

gtgtatatta aaatgtaatt gatttcagtt gaaagtattt taaagctgat aaatagcatt 2950 

agggttcttt gcaatgtggt atctagctgt attattggtt ttatttactt taaacatttt 3010 

gaaaagctta tactggcagc ctagaaaaac aaacaattaa tgtatcttta tgtccctggc 3070 

acatgaataa actttgctgt ggtttact 3098 



<210> 12 

<211> 449 

<212> PRT 

<213> Homo sapiens 

<400> 12 

Met Ala Phe Ala Arg Arg Leu Leu Arg Gly Pro Leu Ser Gly Pro Leu 
15 10 15 

Leu Gly Arg Arg Gly Val Cys Ala Gly Ala Met Ala Pro Pro Arg Arg 
20 25 30 

Phe Val Leu Glu Leu Pro Asp Cys Thr Leu Ala His Phe Ala Leu Gly 
35 40 45 
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Ala Asp Ala Pro Gly Asp Ala Asp Ala Pro Asp Pro 7U:g Leu Ala Ala 
50 55 60 

Leu Leu Gly Pro Pro Glu Arg Ser Tyr Ser Leu Cys Val Pro Val Thr 
65 70 75 80 

Pro Asp Ala Gly Cys Gly Ala Arg Val Arg Ala Ala Arg Leu His Gin 
85 90 95 

Arg Leu Leu His Gin Leu Arg Arg Gly Pro Phe Gin Arg Cys Gin Leu 
100 105 110 

Leu Arg Leu Leu Cys Tyr Cys Pro Gly Gly Gin Ala Gly Gly Ala Gin 
115 120 125 

Gin Gly Phe Leu Leu Arg Asp Pro Leu Asp Asp Pro Asp Thr Arg Gin 
130 135 140 

Ala Leu Leu Glu Leu Leu Gly Ala Cys Gin Glu Ala Pro Arg Pro His 
145 150 155 160 

Leu Gly Glu Phe Glu Ala Asp Pro Arg Gly Gin Leu Trp Gin Arg Leu 
165 170 175 

Trp Glu Val Gin Asp Gly Arg Arg Leu Gin Val Gly Cys Ala Gin Val 
180 185 190 

Val Pro Val Pro Glu Pro Pro Leu His Pro Val Val Pro Asp Leu Pro 
195 200 205 

Ser Ser Val Val Phe Pro Asp Arg Glu Ala Ala Arg Ala Val Leu Glu 
210 215 220 

Glu Cys Thr Ser Phe lie Pro Glu Ala Arg Ala Val Leu Asp Leu Val 
225 230 235 240 

Asp Gin Cys Pro Lys Gin He Gin Lys Gly Lys Phe Gin Val Val Ala 
245 250 255 
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He Glu Gly Leu Asp Ala Thr Gly Lys Thr Thr Val Thr Gin Ser Val 
260 265 270 

Ala Asp Ser Leu Lys Ala Val Leu Leu Lys Ser Pro Pro Ser Cys He 
275 280 285 

Gly Gin Trp Arg Lys He Phe Asp Asp Glu Pro Thr He He Arg Arg 
290 295 300 

Ala Phe Tyr Ser Leu Gly Asn Tyr He Val Ala Ser Glu He Ala Lys 
305 310 315 320 

Glu Ser Ala Lys Ser Pro Val He Val Asp Arg Tyr Trp His Ser Thr 
325 330 335 

Ala Thr Tyr Ala He Ala Thr Glu Val Ser Gly Gly Leu Gin His Leu 
340 345 . 350 

Pro Pro Ala His His Pro Val Tyr Gin Trp Pro Glu Asp Leu Leu Lys 
355 360 365 

Pro Asp Leu He Leu Leu Leu Thr Val Ser Pro Glu Glu Arg Leu Gin 
370 375 380 

Arg Leu Gin Gly Arg Gly Met Glu Lys Thr Arg Glu Glu Ala Glu Leu 
385 390 395 400 

Glu Ala Asn Ser Val Phe Arg Gin Lys Val Glu Met Ser Tyr Gin Arg 
405 410 415 

Met Glu Asn Pro Gly Cys His Val Val Asp Ala Ser Pro Ser Arg Glu 
420 425 430 

Arg Val Leu Gin Thr Val Leu Ser Leu He Gin Asn Ser Phe Ser Glu 
435 440 445 



Pro 
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<210> 13 

<211> 1893 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (418) (1392) 

<223> LBElil67 Clone #20 

<400> 13 

agtccagctg ccgttaggcg ctgggatagt cgcacgctgg atgcatctac gtccgccgag 60 

cccctggggc gaagaggccg cgtccgcctt catttgtggc cggtgcttcg ccccctgacc 120 

cttcgccccc aaagaccagc tctaacgtga gcgcctcggc cgccctgccc cagcctcgta 180 

cacgccgcca gcctcgccca gccggtgtcc ggagaccctc gggccgtgtc catttgtggg 240 

caaagccagc ggggcaggct tggccagagt gcaccactcg gcgccgtccc aggcccgacg 300 

ctctgggcgc gcccggaacc ccaggttcgc ggcccgtgtt tccgaccggc ggagggggct 360 

cagcggcccg atcccacgga agcgcgctcg gaggggtggg acccggccgg accggag 417 

atg gcg ccg cca gcg ggc ggg gcg gcg gcg gcg gcc teg gac ttg ggc 465 
Met Ala Pro Fro Ala Gly Gly Ala Ala Ala Ala Ala Ser Asp Leu Gly 
15 10 15 

tec gcc gca gtg etc ttg get gtg cac gcc gcg gtg agg ccg ctg ggc 513 
Ser Ala Ala Val Leu Leu Ala Val His Ala Ala Val Arg Pro Leu Gly 
20 25 30 

gcc ggg cca gac gcc gag gca cag ctg egg agg ctg cag ctg age gcg 561 
Ala Gly Pro Asp Ala Glu Ala Gin Leu Arg Arg Leu Gin Leu Ser Ala 
35 40 45 

gac cct gag egg ect ggg cgc ttc egg ctg gag ctg ctg ggc gcg gga 609 
Asp Pro Glu Arg Pro Gly Arg Phe Arg Leu Glu Leu Leu Gly Ala Gly 
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50 



55 



60 



cct ggg gcg gtt aat ttg gag tgg ccc ctg gag tea gtt tec tae acc 
Pro Gly Ala Val Asn Leu Glu Trp Pro Leu Glu Ser Val Ser Tyr Thr 
65 70 75 80 



657 



ate cga gge ccc acc cag cac gag eta cag cct cca cca gga ggg cct 
He Arg Gly Pro Thr Gin His Glu Leu Gin Pro Pro Pro Gly Gly Pro 
85 90 95 



705 



gga acc etc age ctg cac tte etc aae cct cag gaa get cag egg tgg 
Gly Thr Leu Ser Leu His Phe Leu Asn Pro Gin Glu /Via Gin Arg Trp 
100 105 110 



753 



gea gte eta gte cga ggt gee acc gtg gaa gga cag aat gge age aag 
Ma Val Leu Val Arg Gly Ala Thr Val Glu Gly Gin Asn Gly Ser Lys 
115 120 125 



801 



age aae tea cca cca gee ttg gge cca gaa gea tgc cct gte tec ctg 
Ser Asn Ser Pro Pro Ala Leu Gly Pro Glu Ala Cys Pro Val Ser Leu 
130 135 140 



849 



2CC agt ccc ccg gaa gee tec aca etc aag gge cct cca cct gag gea 
?ro Ser Pro Pro Glu Ala Ser Thr Leu Lys Gly Pro Pro Pro Glu Ala 
L45 150 155 160 



897 



jat ett cct agg age cct gga aae ttg aeg gag aga gaa gag ctg gea 
isp Leu Pro Arg Ser Pro Gly Asn Leu Thr Glu Arg Glu Glu Leu Ala 
165 170 175 



945 



rgg age ctg gee egg get att gea ggt gga gae gag aag ggg gea gee 
;iy Ser Leu Ala TUrg Ala He Ala Gly Gly Asp Glu Lys Gly Ala Ala 
180 185 190 



993 



:aa gtg gea gee gte ctg gee cag cat cgt gtg gee ctg agt gtt cag 
Jin Val Ala Ala Val Leu Ala Gin His Arg Val Ala Leu Ser Val Gin 
195 200 205 



1041 



tt cag gag gee tgc ttc cca cct gge ccc ate agg ctg cag gte aca 
eu Gin Glu Ala Cys Phe Pro Pro Gly Pro He Arg Leu Gin Val Thr 



1089 
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210 215 220 

ctt gaa gac get gcc tot gcc gca tec gee geg tec tct gca cac gtt 1137 
Leu Glu Asp Ala Ala Ser Ala Ala Ser Ala Ala Ser Ser Ala His Val 
225 230 235 240 

gee ctg cag gtc cac cec cac tgc act gtt gca get etc eag gag cag 1185 
Ala Leu Gin Val His Pro His Cys Thr Val Ala Ala Leu Gin Glu Gin 
245 250 255 

gtg ttc tea gag etc ggt ttc ccg cca gcc gtg caa cgc tgg gtc ate 1233 
Val Phe Ser Glu Leu Gly Phe Pro Pro Ala Val Gin Arg Trp Val lie 
260 265 270 

gga egg tgc ctg tgt gtg ect gag cgc age ctt gcc tct tac ggg gtt 1281 
Gly Arg Cys Leu Cys Val Pro Glu Arg Ser Leu Ala Ser Tyr Gly Val 
275 280 285 

egg cag gat ggg gac ect get ttc etc tac ttg ctg tea get ect cga 1329 
?Vrg Gin Asp Gly Asp Pro Ala Phe Leu Tyr Leu Leu Ser Ala Pro Arg 
290 295 300 

jaa gee cca ggt eag tee teg atg ggg gtg ggg tgt ggg agg tgg ggt 1377 
31u Ala Pro Gly Gin Ser Ser Met Gly Val Gly Cys Gly Arg Trp Gly 
305 310 315 320 

jea gcc eea cag tec tgagctec accccctcag ccacaggace tagcectcag 1430 
\la Ala Pro Gin Ser 
325 

:acccccaga agatggaegg ggaacttgga cgettgtttc ceccatcatt ggggctaccc 1490 

:eaggceecc agecagetgc ctecagectg cecagtccac teeagcccag ctggtcctgt 1550 

:cttcctgca cettcatcaa tgccccagac cgccctggct gtgagatgtg tagcacccag 1610 

.ggccctgca cttgggaccc ccttgctgca gcttccacct agcagccacc agaggtacca 1670 

aggtggcac aggcagggga ggtggggggc cagggeagaa tccacaggaa tgaeeeaget 1730 
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cctcccccac aggttacaag gggagagtgg cccttccctc acaagtccga catctccagg 1790 
cccccactga actccgggga cctctactga ctgcttgctg ggacagtcac cagggttggg 1850 
gggaagggcc acaaaatgaa accattaaag acccttaaga gcc 1893 

<210> 14 

<211> 325 

<212> PRT 

<213> Homo sapiens 

<400> 14 

Met Ala Pro Pro Ala Gly Gly Ala Ala Ala Ala Ala Ser Asp Leu Gly 
15 10 15 

Ser Ala Ala Val Leu Leu Ala Val His Ala Ala Val Arg Pro Leu Gly 
20 25 30 

Ala Gly Pro Asp Ala Glu Ala Gin Leu Arg Arg Leu Gin Leu Ser Ala 
35 40 45 

Asp Pro Glu Arg Pro Gly Arg Phe Arg Leu Glu Leu Leu Gly Ala Gly 
50 55 60 

Pro Gly Ala Val Asn Leu Glu Trp Pro Leu Glu Ser Val Ser Tyr Thr 
65 70 75 80 

Cle Arg Gly Pro Thr Gin His Glu Leu Gin Pro Pro Pro Gly Gly Pro 
85 90 95 

;iy Thr Leu Ser Leu His Phe Leu Asn Pro Gin Glu Ala Gin TVrg Trp 
100 105 110 

aa Val Leu Val Arg Gly Ala Thr Val Glu Gly Gin Asn Gly Ser Lys 
115 120 125 

Jer Asn Ser Pro Pro Ala Leu Gly Pro Glu Ala Cys Pro Val Ser Leu 
130 135 140 
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Pro Ser Pro Pro Glu Ala Ser Thr Leu Lys Gly Pro Pro Pro Glu Ala 
145 150 155 160 

Asp Leu Pro Arg Ser Pro Gly Asn Leu Thr Glu Arg Glu Glu Leu Ala 
165 170 175 

Gly ser Leu Ala Arg Ala He Ala Gly Gly Asp Glu Lys Gly Ala Ala 
180 185 190 

Gin Val Ala Ala Val Leu Ala Gin His Arg Val Ala Leu Ser Val Gin 
195 200 205 

Leu Gin Glu Ala Cys Phe Pro Pro Gly Pro He Arg Leu Gin Val Thr 
210 215 220 

Leu Glu Asp Ala Ala Ser Ala Ala Ser Ala Ala Ser Ser Ala His Val 
225 230 235 240 

Ala Leu Gin Val His Pro His Cys Thr Val Ala Ala Leu Gin Glu Gin 
245 250 255 

Val Phe Ser Glu Leu Gly Phe Pro Pro Ala Val Gin Arg Trp Val He 
260 265 270 

Gly Arg Cys Leu Cys Val Pro Glu Arg Ser Leu Ala Ser Tyr Gly Val 
275 280 285 

Arg Gin Asp Gly Asp Pro Ala Phe Leu Tyr Leu Leu Ser Ala Pro Arg 
290 295 300 

Glu Ala Pro Gly Gin Ser Ser Met Gly Val Gly Cys Gly Arg Trp Gly 
305 310 315 320 

Ala Ala Pro Gin Ser 
325 



<210> 15 
<211> 1597 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> CDS 

<222> (271) . . (1431) 

<223> liBFL167 Clone #46 

<400> 15 

gtgagcgcct cggccgccct gccccagcct cgtacacgcc gccagctcgc ccagccggtg 60 

tccggagacc ctcgggccgt gtccatttgt gggcaaagcc agcggggcag gcttggccag 120 

agtgcaccac tcggcgccgt cccaggcccg acgctctggg cgcgcccgga accccaggtt 180 

cgcggcccgt gtttccgacc ggcggagggg gctcagcggc ccgatcccac ggaagcgcgc 240 

tcggaggggt gggacccggc cggaccggag atg gcg ccg cca gcg ggc ggg gcg 294 

Met Ala Pro Pro Ala Gly Gly Ala 
1 5 

gcg gcg gcg gcc teg gac ttg ggc tec gcc gca gtg etc ttg get gtg 342 
Ala Ala Ala Ala Ser Asp Leu Gly Ser Ala Ala Val Leu Leu Ala Val 
10 15 20 

cac gcc gcg gtg agg ccg ctg ggc gcc ggg cca gac gcc gag gca cag 390 
His Ala Ala Val Arg Pro Leu Gly Ala Gly Pro Asp Ala Glu Ala Gin 
25 30 35 40 

ctg egg agg ctg cag ctg age gcg gac cct gag agg ect ggg cgc ttc 438 
Leu Arg Arg Leu Gin Leu Ser Ala Asp Pro Glu Arg Pro Gly Arg Phe 
45 50 55 

3gg ctg gag ctg ctg ggc gcg gga cct ggg gcg gtt aat ttg gag tgg 486 
Arg Leu Glu Leu Leu Gly Ala Gly Pro Gly Ala Val Asn Leu Glu Trp 
60 65 70 

zee ctg gag tea gtt tec tac ace ate cga ggc ecc ace cag cac gag 534 
Pro Leu Glu Ser Val Ser Tyr Thr He Arg Gly Pro Thr Gin His Glu 
75 80 85 
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eta cag cct cca cca gga ggg cct gga acc etc age etg eae tte ete 
Leu Gin Pro Pro Pro Gly Gly Pro Gly Thr Leu Ser Leu His Phe Leu 
90 95 100 



582 



aae eet eag gaa get cag egg tgg gca gtc eta gte cga ggt gee acc 
Asn Pro Gin Glu Ala Gin Arg Trp Ala Val Leu Val Arg Gly Ala Thr 
105 110 115 120 



630 



gtg gaa gga cag aat ggc age aag age aac tea cca cca gee ttg gge 
Val Glu Gly Gin Asn Gly Ser Lys Ser Asn Ser Pro Pro Ala Leu Gly 
125 130 135 



678 



cca gaa gca tgc eet gte tec etg cec agt ccc ccg gaa gee tec aea 
Pro Glu Ala Cys Pro Val Ser Leu Pro Ser Pro Pro Glu Ala Ser Thr 
140 145 150 



726 



=tc aag gge eet cca cct gag gca gat ett eet agg age eet gga aae 
Leu Lys Gly Pro Pro Pro Glu Ala Asp Leu Pro Arg Ser Pro Gly Asn 
155 160 165 



774 



:tg acg gag aga gaa gag etg gca ggg age etg gee egg get att gca 
jeu Thr Glu Arg Glu Glu Leu Ala Gly Ser Leu Ala Arg Ala He Ala 
170 175 180 



822 



rgt gga gae gag aag ggg gca gee caa gtg gca gcc gtc etg gee cag 
;iy Gly Asp Glu Lys Gly Ala Ala Gin Val Ala Ala Val Leu Ala Gin 
-85 190 195 200 



870 



:at cgt gtg gcc etg agt gtt cag ctt cag gag gcc tgc ttc cca cct 
:is Arg Val Ala Leu Ser Val Gin Leu Gin Glu Ala Cys Phe Pro Pro 
205 210 215 



918 



gc ccc ate agg etg cag gtc aca ctt gaa gae get gee tet gee gca 
ly Pro He Arg Leu Gin Val Thr Leu Glu Asp Ala Ala Ser Ala Ala 
220 225 230 



966 



cc gcc gcg tec tet gca eae gtt gee etg cag gte eae ccc eae tgc 
er Ala Ala Ser Ser Ala His Val Ala Leu Gin Val His Pro His Cys 
235 240 245 



1014 
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act gtt gca get etc cag gag cag gtg ttc tea gag etc ggt ttc ccg 
Thr Val Ala Ala Leu Gin Glu Gin Val Phe Ser Glu Leu Gly Phe Pro 
250 255 260 



1062 



cca gee gtg caa cgc tgg gtc ate gga egg tgc ctg tgt gtg cct gag 
Pro Ala Val Gin Arg Trp Val lie Gly Arg Cys Leu Cys Val Pro Glu 
265 270 275 280 



1110 



cgc age ctt gee tct tac ggg gtt egg cag gat ggg gae cct get ttc 
Arg Ser Leu Ala Ser Tyr Gly Val hxg Gin Asp Gly Asp Pro TUa Phe 
285 290 295 

Dtc tac ttg ctg tea get cct cga gaa gee cca gee aca gga cct age 
Ceu Tyr Leu Leu Ser Ala Pro Arg Glu Ala Pro Ala Thr Gly Pro Ser 
300 305 310 



1158 



1206 



:ct cag cac cec cag aag atg gae ggg gaa ctt gga cgc ttg ttt ccc 1254 
?ro Gin His Pro Gin Lys Met Asp Gly Glu Leu Gly Arg Leu Phe Pro 
315 320 325 

:ca tea ttg ggg eta ccc cca ggc ccc cag cca get gee tec age ctg 1302 
>ro Ser Leu Gly Leu Pro Pro Gly Pro Gin Pro Ala Ala Ser Ser Leu 
330 335 340 

:ec agt cca etc cag cec age tgg tec tgt cct tee tgc ace ttc ate 1350 
ro Ser Pro Leu Gin Pro Ser Trp Ser Cys Pro Ser Cys Thr Phe He 
350 355 360 

at gee cca gae ege eet gge tgt gag atg tgt age ace cag agg ccc 1398 
sn Ala Pro Asp Arg Pro Gly Cys Glu Met Cys Ser Thr Gin Arg Pro 
365 370 375 

gic act tgg gae ece ett get gea get tee ace tagcageca ceagaggtta 1450 
/s Thr Trp Asp Pro Leu Ala Ala Ala Ser Thr 
380 385 



iaggggaga gtggccctte cetcacaagt cegacatcte caggceccca etgaaetecg 
rgacetcta etgaetgett getgggacag tcaceagggt tggggggaag ggecaeaaaa 



1510 



1570 
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tgaaaccatt aaagaccctt aagagcc 1597 

<210> 16 

<211> 387 

<212> PRT 

<213> Homo sapiens 

<400> 16 

Met Ala Pro Pro Ala Gly Gly Ala Ala Ala Ala Ala Ser Asp Leu Gly 
15 10 15 

Ser Ala Ala Val Leu Leu Ala Val His Ala Ala Val Arg Pro Leu Gly 
20 25 30 

Ala Gly Pro Asp Ala Glu Ala Gin Leu Arg Arg Leu Gin Leu Ser Ala 
35 40 45 

Asp Pro Glu Arg Pro Gly Arg Phe Arg Leu Glu Leu Leu Gly Ala Gly 
50 55 60 

Pro Gly Ala Val Asn Leu Glu Trp Pro Leu Glu Ser Val Ser Tyr Thr 
65 70 75 80 

He Arg Gly Pro Thr Gin His Glu Leu Gin Pro Pro Pro Gly Gly Pro 
85 90 95 

Sly Thr Leu Ser Leu His Phe Leu Asn Pro Gin Glu Ala Gin Arg Trp 
100 105 110 

Ua Val Leu Val Arg Gly Ala Thr Val Glu Gly Gin Asn Gly Ser Lys 
115 120 125 

3er Asn Ser Pro Pro Ala Leu Gly Pro Glu Ala Cys Pro Val Ser Leu 
130 135 140 

?ro Ser Pro Pro Glu Ala Ser Thr Leu Lys Gly Pro Pro Pro Glu Ala 
.45 150 155 160 
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Asp Leu Pro Arg Ser Pro Gly Asn Leu Thr Glu Arg Glu Glu Leu Ala 
165 170 175 

Gly Ser Leu Ala Arg Ala He Ala Gly Gly Asp Glu Lys Gly Ala Ala 
180 185 190 

Gin val Ala Ala Val Leu Ala Gin His Arg Val Ala Leu Ser Val Gin 
195 200 205 

Leu Gin Glu Ala Cys Phe Pro Pro Gly Pro He Arg Leu Gin Val Thr 
210 215 220 

Leu Glu Asp Ala Ala Ser Ala Ala Ser Ala Ala Ser Ser Ala His Val 
225 230 235 240 

Ala Leu Gin Val His Pro His Cys Thr Val Ala Ala Leu Gin Glu Gin 
245 250 255 

Val Phe Ser Glu Leu Gly Phe Pro Pro Ala Val Gin Arg Trp Val He 
260 265 270 

Gly Arg Cys Leu Cys Val Pro Glu Arg Ser Leu Ala Ser Tyr Gly Val 
275 280 285 

Arg Gin Asp Gly Asp Pro Ala Phe Leu Tyr Leu Leu Ser Ala Pro Arg 
290 295 300 

31u Ala Pro Ala Thr Gly Pro Ser Pro Gin His Pro Gin Lys Met Asp 
305 310 315 320 

Sly Glu Leu Gly Arg Leu Phe Pro Pro Ser Leu Gly Leu Pro Pro Gly 
325 330 335 

Pro Gin Pro Ala Ala Ser Ser Leu Pro Ser Pro Leu Gin Pro Ser Trp 
340 345 350 

Ser Cys Pro Ser Cys Thr Phe He Asn Ala Pro Asp Arg Pro Gly Cys 
355 360 365 

51u Met Cys Ser Thr Gin Arg Pro Cys Thr Trp Asp Pro Leu Ala Ala 
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370 



375 



380 



Ala Ser Thr 
385 



<210> 17 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 17 

gctgaagcag gaaaatcgct t 21 

<210> 18 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 18 

tgagacggag tctcactcgg t 21 

<210> 19 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

c223> Primer 
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<400> 19 

gtttttccta attttggcat gaac 

<210> 20 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
C220> 

C223> Primer 

:400> 20 
:gcccaagct tttcctttt 

:210> 21 

:211> 26 

212> DNA 

213> Artificial Sequence 
220> 

22 3> Primer 

400> 21 

accctttgc ctctgtcact tccgca 

!10> 22 

Jll> 23 

:12> DNA 

:13> Artificial Sequence 
20> 

23> Primer 
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<400> 22 

gctggagcac caggactgca ttg 

<210> 23 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
:400> 23 

rgagctgagc agcagtgtaa tgaa 

:210> 24 

:211> 25 

212> DNA 

213> Artificial Sequence 
220> 

223> Primer 

iOO> 24 

aggcctgcc tgaaggagga gcttc 

!10> 25 

:il> 26 

:12> DNA 

13> Artificial Sequence 
20> 

23> Primer 

00> 25 
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tctggaagta gtgcagacgc ctcagg 

<210> 26 ' 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

:400> 26 

igccaacgtc ggctttgtta tccagc 

:210> 27 

:211> 25 

212> DNA 

213> Artificial Sequence 
220> 

223> Primer 

300> 27 

stgtcagat atgatggttc tggac 

!10> 28 

:il> 24 

:12> DNA 

13> Artificial Sequence 
20> 

23> Primer 

00> 28 

agcctcac cactgttggg ttgc 
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<210> 29 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 29 

cattctctga gctgtattag tgt 23 

<210> 30 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 30 

cctgagctgg aatgacctgc a 21 

<210> 31 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 31 

3tttgtgttg gctgcagcca ca 22 
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<210> 32 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 32 

-tgaggagaga ctttgctgac tggt 24 

<210> 33 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 33 

gtcctgtctg gcggtgccga 20 

<210> 34 

<211> 42 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 34 

gctccaggat cccctgtcac ctgggccttc tgccttttgg ct 42 
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<210> 35 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 35 

ccatatggag aggagagcag cgggccca 

<210> 36 

<21X> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

C223> Primer 

:400> 36 

raaggaggaa catggagagg aga 

:210> 37 

:211> 27 

212> DNA 

213> Artificial Sequence 
220> 

223> Primer 

400> 37 

2atatgccc cgggtagtct actgcat 

110> 38 
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<211> 33 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 38 

gtcgactcga gtcacttccg caaaaacttc ttg 

<210> 39 

<211> 22 

C212> DNA 

C213> Artificial Sequence 
C220> 

:223> Primer 

:400> 39 

:ccattccga aggctctcct cc 



210> 40 

211> 23 

212> DNA 

213> Artificial Sequence 
220> 

223> Primer 

iOO> 40 

ictgtgtga cggaaatgta age 

!10> 41 

:il> 23 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 41 

gaaggtcgaa ggcagaccga tgt 

<210> 42 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

C400> 42 

jaatgtgtca gagacaagtg cage 

:210> 43 

:211> 26 

:212> DNA 

:213> Artificial Sequence 
220> 

223> Primer 

400> 43 

gtagaaact cttggactaa tggagg 

210> 44 

211> 29 

U2> DNA 
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<213> 



Artificial Sequence 



<220> 



<223> 



Primer 



<400> 



44 



gtatgcatca gaattcccta tagatcttt 



29 



<210> 45 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 45 

tagatgtttg ggcaacagcc t 21 

<210> 46 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

C400> 46 

itccttcacc aaaggcatcc agccattcta tg 32 

C210> 47 

:211> 28 

:212> DNA 

:213> Artificial Sequence 
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<220> 



<223> 



Primer 



<400> 



47 



aaatgtctga ttaccccatt ttatcagt 



28 



<210> 48 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 48 

taatcctgaa atgaacagct aaca 24 

<210> 49 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 49 

taatgttaga gtaacagcat tttccttcaa 30 

<210> 50 

C211> 29 

<212> DNA 

C213> Artificial Sequence 
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<220> 

<223> Primer 
<400> 50 

tgccccacac taactcagtt cttgtgatg 

<210> 51 
<211> 29 

<212> DNA 

C213> Artificial Sequence 
:220> 

:223> Primer 

:400> 51 

aggctttat caggtctgca tatagaatc 

210> 52 

211> 28 

212> DNA 

213> Artificial Sequence 
220> 

i23> Primer 

:00> 52 

saaagaacc ctaatgctat ttatcagc 

10> 53 

11> 22 

12> DNA 

13> Artificial Sequence 
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Sequence Listing 



<223> Primer 
<400> 53 

gagaagacca gggaagaagc ag 

<210> 54 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 54 
ggacggggaa cttggacgc 

C210> 55 
C211> 20 

C212> DNA 

:213> Artificial Sequence 
:220> 

:223> Primer 

:400> 55 
agtgcaggg cctctgggtg 

210> 56 

211> 19 

212> DNA 

213> Artificial Sequence 
220> 

123> Primer 
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<400> 56 

cgctgggtca tcggacggt 19 
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GENE FAMILIES ASSOCIATED WITH CANCERS 



FIELD OF THE INVENTION 

The present invention relates to the changes in gene expression in human tissues 
5 from cancer patients. The invention specifically relates to human genes which are 
differentially expressed in cancer tissues of breast, colon, esophagus, kidney, liver, lung, 
lymph node, ovary, pancreas, prostate, rectum, and/or stomach compared to corresponding 
normal tissues. 

BACKGROUND OF THE INVENTION 

10 In the United States, more than one million new cancer cases are diagnosed and 

about half million people die of cancer. The causes of cancer are many and varied, and 
include genetic predisposition, environmental influences, infectious agents and ageing. 
These transform normal cells into cancerous ones by derailing a wide spectrum of 
regulatory and downstream effector pathways. Several essential alterations in cell 

15 physiology collectively dictate malignant growth: self-sufficiency in growth signals, 
insensitivity to growth-inhibitory signals, evasion of programmed cell death, limitless 
replicative potential, sustained angiogenesis, and tissue invasion and metastasis (Hanahan 
and Weinberg (2000), Cell 100:57-70). 

To date, researchers have been able to identify many genetic alterations believed to 
20 underlie tumor development. These genetic alterations include amplification of oncogenes 
and mutations that result in the loss of tumor suppressor genes. Oncogenes were initially 
identified as genes carried by viruses that cause transformation of their target cells. A 
major class of the viral oncogenes have cellular counterparts that are involved in normal 
cell functions. The cellular genes are called proto-oncogene, and in certain cases their 
25 mutation or aberrant in the cell is associated with tumor formation. The generation of a 
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oncogene represents a gain-of-function in which a cellular proto-oncogene is 
inappropriately activated. This can involve a mutational change in the protein, or 
constitutive activation, over-expression, or failure to turn off expression at the appropriate 
time. About 100 oncogenes have been identified. Examples of oncogenes include, but are 

5 not limited to, ras, fos, myc, abl, and myb (Ponder (2001), Nature 411:336-341), Tumor 
suppressor genes, in their wild-type alleles, express proteins that suppress abnormal 
cellular proliferation. When the gene coding for a tumor suppressor protein is mutated or 
deleted, the resulting mutant protein or the complete lack of tumor suppressor protein 
expression may fail to correctly regulate cellular proliferation, and abnormal proliferation 

10 may take place, particularly if there is already existing damage to the cellular regulatory 
mechanism. A number of well-studied human tumors and tumor cell lines have missing or 
non-functional tumor suppressor genes. Examples of tumor suppressor genes include, but 
are not limited to, the retinoblastoma susceptibility gene or RB gene, the p53 gene, the 
deletion in colon carcinoma pCC) gene and the neurofibromatosis type 1 (NF-1) tumor 

15 suppressor gene (Weinberg (1991), Science 254:1138-1146). Loss-of-function or 
inactivation of tumor suppressor genes may play a central role in the initiation and/or 
progression of a significant number of human cancers. 

The utilization of genome-wide expression profiles to classify tumors, to identify 
drug targets, to identify diagnostic markers and/or to gain further insights into the 

20 consequences of chemotherapeutic treatments could facilitate the design of more 
efficacious stratagems for treating a variety of cancers. Initial studies utilizing gene 
expression patterns to identify subtypes of cancer produced rather intriguing results (see 
Perou et al (1999), Proc Natl Acad Sci USA 96:9212-9217; Golub et al (1999), Science 
286:531-537; Alizadeh et al (2000), Nature 403:503-511; Alon et al (1999), Proc Natl 

25 Acad Sci USA 96:6745-6750; and Bittner et al. (2000), Nature 406:536-540; Perou et al. 
(2000), Nature 406:747-752). Molecular classification of B-cell lymphoma by gene 
expression profiling elucidated clinically distinct diffuse large-B-cell lymphoma 
subgroups (see Alizadeh et al.y supra). In breast cancer, studies utilizing limited numbers 
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of genes (8,102 genes) have classified tumors into subtypes based on gene expression 
profiles, and this study indicated a diversity of molecular phenotypes associated with 
breast tumors (see Perou et al.y supra). In addition, the expression profiling has enabled 
researchers to map tissue-specific expression levels for thousands of genes (Alon et al 
5 (1999), Proc Natl Acad Sci USA 96:6745-6750; Iyer et al (1999), Science 283:83-87; 
Khan et a/.(1998), Cancer Res 58:5009-5013; Lee et a/.(1999). Science 285:1390-1393; 
Wang et al (1999), Gene 229:101-108; Whitney et al (1999), Ann Neurol 46:425-428). 
Although these studies have demonstrated that expression profiling may be used to 
produce improvements in diagnosis of human diseases such as cancer, as well as in the 
1 0 development of improved therapeutic strategies, further studies are needed. 

Although cancers are diverse and heterogeneous as they are derived from 
numerous tissues and multiple etiologic factors, it has been suggested that underlying this 
variability lies a relatively small number of critical events whose convergence is required 
for the development of any and all cancers (Evan and Vousden (2001), Nature 411:342- 

15 348). Accordingly, there exists a need for the comprehensive investigation of the changes 
in global gene expression levels in many different types of cancers to identify critical 
molecular markers associated with the development and progression of cancer. There 
remains a need in the art for materials and methods that permit a more accurate diagnosis 
of cancer. In addition, there remains a need in the art for methods to treat and methods to 

20 identify agents that can effectively treat this disease. Hie present invention meets these 
and other needs. 

SUMMARY OF THE INVENTION 

The present invention is based on new genes that are differentially expressed in 
cancer tissues compared to normal tissues, hereinafter LFGl, LFG2, LFG3, LFG4, LFG5, 
25 LFG6, respectively. The invention includes isolated nucleic acid molecules comprising 
SEQ ID NO: 1, 3, 5, 7, 9, 1 1, 13 or 15 or the complement thereof. 
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The present invention further includes the nucleic acid molecules operably linked 
to one or more expression control elements, including vectors comprising the isolated 
nucleic acid molecules. The invention further includes host cells transformed to contain 
the nucleic acid molecules of the invention and methods for producing a protein 
S comprising the step of culturing a host cell transformed with a nucleic acid molecule of 
the invention under conditions in which the protein is expressed. 

The invention further provides an isolated polypeptide selected from the group 
consisting of an isolated polypeptide comprising the amino acid sequence of SEQ ID NO: 
2, 4, 6, 8, 10, 12, 14 or 16, an isolated polypeptide comprising a fragment of at least 10 

10 amino acids of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16, an isolated polypeptide 
comprising conservative amino acid substitutions of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 
16 and an isolated polypeptide comprising naturally occurring amino acid sequence 
variants of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16. Polypeptides of the invention also 
include polypeptides with an amino acid sequence having at least about 50%, 60%, 70% 

IS or 75% amino acid sequence identity with the sequence set forth in SEQ ID NO: 2, 4, 6, 8, 
10, 12, 14 or 16, preferably at least about 80%, more preferably at least about 90-95%, and 
most preferably at least about 95-98% sequence identity with the sequence set forth in 
SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16. 

The present invention further provides methods of identifying other members of 
20 the polypeptide family of the invention. Specifically, the nucleic acid sequence of SEQ ID 
NO: 1, 3, 5, 7, 9, 1 1, 13 or 15 can be used as a probe, or to generate PGR primers, in 
methods to identify nucleic acid molecules that encode other members of the LFGl, LFG2, 
LFG3, LFG4, LFG5 or LFG6 family of proteins. 

The invention further provides an isolated antibody or antigen-binding antibody 
25 fragment that specifically binds to a polypeptide of the invention, including monoclonal 
and polyclonal antibodies. 
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The invention further provides methods of identifying an agent which modulates 
the expression of a nucleic acid molecule encoding a protein of the invention, comprising: 
exposing cells which express the nucleic acid molecule to the agent; and determining 
whether the agent modulates expression of said nucleic acid molecule, thereby identifying 
5 an agent which modulates the expression of a nucleic acid molecule encoding the protein. 

The invention further provides methods of identifying an agent which modulates 
the level of or at least one activity of a protein of the invention, comprising: exposing cells 
which express the protein to the agent; and determining whether the agent modulates the 
level of or at least one activity of said protein, thereby identifying an agent which 
10 modulates the level of or at least one activity of the protein. 

The present invention further provides methods of modulating the expression of a 
nucleic acid molecule encoding a protein of the invention, comprising the step of 
administering an effective amount of an agent which modulates the expression of a nucleic 
acid molecule encoding the protein. The invention also provides methods of modulating 
15 at least one activity of a protein of the invention, comprising the step of administering an 
effective amount of an agent which modulates at least one activity of the protein of the 
invention. 

The invention further provides metfiods of identifying binding partners for a 
protein of tfie invention, comprising the steps of exposing said protein to a potential 
20 binding partner; and determining if the potential binding partner binds to said protein, 
thereby identifying binding partners for the protein. 

The present invention further provides methods to identify agents that can block or 
modulate the association of a protein of the invention with a binding partner. Specifically, 
an agent can be tested for the ability to block, reduce or otherwise modulate the 
25 association of a protein of invention with a binding partner by contacting said protein, or a 
fragment thereof, and a binding partner with a test agent and determining whether the test 
agent blocks or reduces the binding of the protein of invention to the binding partner. 
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The present invention further provides methods for reducing or blocking the 
association of a protein of invention with one or more of its binding partners, comprising 
the step of administrating an effective amount of an agent which reduces or blocks the 
binding of said protein to the binding partner. The method can utilize an agent that binds 
S to the protein of invention or to the binding partner. 

In accordance with another aspect of the invention, the proteins of the invention 
can be used as starting points for rational drug design to provide ligands, therapeutic drugs 
or other types of small chemical molecules. Alternatively, small molecules or other 
compounds identified by the above-described screening assays may serve as "lead 
10 compounds" in rational drug design. 

The present invention further relates to a process for treating cancer comprising 
inserting into a cancerous cell a nucleic acid construct comprising the nucleic acid 
molecules of the invention operably linked to a promoter or enhancer element such that 
expression of said nucleic acid molecule causes suppression of said cancer. 

15 The present invention further includes non-human transgenic animals modified to 

contain the nucleic acid molecules of the invention, or non-human transgenic animals 
modified to contain the mutated nucleic acid molecules such that expression of the 
encoded polypeptides of the invention is prevented. 

The present invention also includes non-human transgenic animals in which all or 
20 a portion of a gene comprising all or a portion of SEQ ID NO: 1, 3, 5, 7, 9, 1 1, 13 or 15 
has been knocked out or deleted from the genome of the animal. 

The invention further provides methods of diagnosing cancers, comprising the 
steps of acquiring a tissue, blood, urine or other sample from a subject and determining the 
level of expression of a nucleic acid molecule of the invention or polypeptide of the 
25 invention. 
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The invention further includes compositions comprising a diluent and a 
polypeptide or protein selected from the group consisting of an isolated polypeptide 
comprising the amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16, an 
isolated polypeptide comprising a fragment of at least 10 amino acids of SEQ ID NO: 2, 4, 

5 6, 8, 10, 12, 14 or 16, an isolated polypeptide comprising conservative amino acid 
substitutions of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16, naturally occurring amino acid 
sequence variants of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16 and an isolated polypeptide 
with an amino acid sequence having at least about 50%, 60%, 70% or 75% amino acid 
sequence Identity with the sequence set forth In SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16, 

1 0 preferably at least about 80%, more preferably at least about 90-95%, and most preferably 
at least about 95-98% sequence Identity with the sequence set forth In SEQ ID NO: 2, 4, 6, 
8, 10, 12, 14 or 16. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the relative alignment positions of the two LFGl clones. 

15 Figure 2 is a hydrophobicity plot of the protein encoded by the open reading frame 

of LFGl -Clone A (SEQ ID NO: 2). Analysis was p«rformed according to the method of 
Kyte-Doolittle. 

Figure 3 is a hydrophobicity plot of the protein encoded by the open reading frame 
of LFGl -Clone B (SEQ ID NO: 4). Analysis was performed according to the method of 
20 Kyte-Doolittle. 

Figure 4 is a hydrophobicity plot of the protein encoded by the open reading frame 
of LFG2 (SEQ ID NO: 6). Analysis was performed according to the method of Kyte- 
Doolittle. 
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Figure S is a hydrophobicity plot of the protein encoded by the open reading frame 
of LFG3 (SEQ ID NO: 8). Analysis was performed according to the method of Kyte- 
Doolittle. 

Figure 6 is a hydrophobicity plot of the protein encoded by the open reading frame 
5 of LFG4 (SEQ ID NO: 10). Analysis was performed according to the method of Kyte- 
Doolittle. 

Figure 7 is a hydrophobicity plot of the protein encoded by the open reading frame 
of ALFGS (SEQ ID NO: 12). Analysis was performed according to the method of Kyte- 
Doolittle. 

10 Figure 8 shows the relative alignment positions of the two LFG6 clones. 

Figure 9 is a hydrophobicity plot of the protein encoded by the open reading frame 
of LFG6-#20 (SEQ ID NO: 14). Analysis was performed according to the method of 
Kyte-Doolittle. 

Figure 10 is a hydrophobicity plot of the protein encoded by the open reading 
15 frame of LFG6-#46 (SEQ ID NO: 16). Analysis was performed according to the method 
of Kyte-Doolittle. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

/ General Description 

The present invention is based in part on the identification of new gene families 
20 that are difTerentially expressed in cancerous human tissues compared to normal human 
tissues. These gene families correspond to the human cDNA of SEQ ID NOS: 1, 3, S, 7, 9, 
11, 13 and IS. 
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The genes and proteins of the invention may be used as diagnostic agents or 
markers to detect cancer or to differentiate carcinoma from normal tissue in a sample. 
They can also serve as a target for agents that modulate gene expression or activity. For 
example, agents may be identified that modulate biological processes associated with 
5 tumor growth, including the hyperplastic process of cancer. 

//. Specific Embodiments 

A. The Proteins Associated with Cancer 

The present invention provides isolated proteins, allelic variants of the proteins, 
and conservative amino acid substitutions of the proteins. As used herein, the "protein" or 

1 0 "polypeptide" refers, in part, to a protein that has the human amino acid sequence depicted 
in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16. The terms also refer to naturally occurring 
allelic variants and proteins that have a slightly different amino acid sequence than that 
specifically recited above. Allelic variants, though possessing a slightly different amino 
acid sequence than those recited above, will still have the same or similar biological 

1 5 functions associated with these proteins. 

As used herein, the &mily of proteins related to the human amino acid sequence of 
SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16 refers to proteins that have been isolated from 
organisms in addition to humans. The methods used to identify and isolate other members 
of the family of proteins related to these proteins are described below. 

20 The proteins of the present invention are preferably in isolated form. As used 

herein, a protein is said to be isolated when physical, mechanical or chemical methods are 
employed to remove the protein from cellular constituents that are normally associated 
with the protein. A skilled artisan can readily employ standard purification methods to 
obtain an isolated protein. 
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The proteins of the present invention further include insertion, deletion or 
conservative amino acid substitution variants of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16. 
As used herein, a conservative variant refers to alterations in the amino acid sequence that 
do not adversely afTect the biological functions of the protein. A substitution, insertion or 

5 deletion is said to adversely affect the protein when the altered sequence prevents or 
disrupts a biological function associated with the protein. For example, the overall charge, 
structure or hydrophobic/hydrophilic properties of the protein, in certain instances, may be 
altered without adversely affecting a biological activity. Accordingly, the amino acid 
sequence can be altered, for example to render the peptide more hydrophobic or 

1 0 hydrophilic, without adversely affecting the biological activities of the protein. 

Ordinarily, the allelic variants, the conservative substitution variants, and the 
members of the protein family, will have an amino acid sequence having at least about 
50%, 60%, 70% or 75% amino acid sequence identity with the sequence set forth in SEQ 
ID NO: 2, 4, 6, 8, 10, 12, 14 or 16, more preferably at least about 80%, even more 

15 preferably at least about 90-95%, and most preferably at least about 95-98% sequence 
identity. Identity or homology with respect to such sequences is defined herein as the 
percentage of amino acid residues in the candidate sequence that are identical with SEQ 
ID NO: 2, 4, 6, 8, 10, 12, 14 or 16, after aligning the sequences and introducing gaps, if 
necessary, to achieve the maximum percent homology, and not considering any 

20 conservative substitutions as part of the sequence identity (see section B for the relevant 
parameters). Fusion proteins, or N-terminal, C-terminal or internal extensions, deletions, 
or insertions into the peptide sequence shall not be construed as affecting homology. 

Thus, the proteins of the present invention include molecules having the amino 
acid sequence disclosed in SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16; fragments thereof 
25 having a consecutive sequence of at least about 3, 4, 5, 6, 10, 15, 20, 25, 30, 35 or more 
amino acid residues of these proteins; amino acid sequence variants wherein one or more 
amino acid residues has been inserted N- or C-terminal to, or within, the disclosed coding 
sequence; and amino acid sequence variants of the disclosed sequence, or their fragments 

10 
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as defined above, that have been substituted by at least one residue. Such fragments, also 
referred to as peptides or polypeptides, may contain antigenic regions, functional regions 
of the protein identified as regions of the amino acid sequence which correspond to known 
protein domains, as well as regions of pronounced hydrophilicity. The regions are all 
5 easily identifiable by using commonly available protein sequence analysis software such 
as MacVector (Oxford Molecular). 

Contemplated variants further include those containing predetermined mutations 
by, e.g., homologous recombination, site-directed or PGR mutagenesis, and the 
corresponding proteins of other animal species, including but not limited to rabbit, mouse, 
10 rat, porcine, bovine, ovine, equine and non-human primate species, and the alleles or other 
naturally occurring variants of the family of proteins; and derivatives wherein the protein 
has been covalently modified by substitution, chemical, enzymatic, or other appropriate 
means with a moiety other than a naturally occurring amino acid (for example a detectable 
moiety such sis an enzyme or radioisotope). 

15 The present invention further provides compositions comprising a protein or 

polypeptide of the invention and a diluent. Suitable diluents can be aqueous or non- 
aqueous solvents or a combination thereof, and can comprise additional components, for 
example water-soluble salts or glycerol, that contribute to the stability, solubility, activity, 
and/or storage of the protein or polypeptide. 

20 As described below, members of the families of proteins can be used: (1) to 

identify agents which modulate the level of or at least one activity of the protein, (2) to 
identify binding partners for the protein, (3) as an antigen to raise polyclonal or 
monoclonal antibodies, (4) as a therapeutic agent or target and (5) as a diagnostic agent or 
marker of cancer. 

25 B. Nucleic Acid Molecules 
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The present invention further provides nucleic acid molecules that encode the 
protein having SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16 and the related proteins herein 
described, preferably in isolated form. As used herein, "nucleic acid" is defined as RNA 
or DNA that encodes a protein or peptide as defined above, is complementary to a nucleic 

5 acid sequence encoding such peptides, hybridizes to the nucleic acid of SEQ ID NO: 1, 3, 
5, 7, 9, 1 1, 13 or 15 and remains stably bound to it under appropriate stringency conditions, 
encodes a polypeptide sharing at least about 50%, 60%, 70% or 75%, preferably at least 
about 80%, more preferably at least about 90-95%, and most preferably at least about 95- 
98% or more identity with the peptide sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16 

10 or exhibits at least 50%, 60%, 70% or 75%, preferably at least about 80%, more preferably 
at least about 90-95%, and most preferably at least about 95-98% or more nucleotide 
sequence identity over the open reading frames of SEQ ID NO: 1 , 3, 5, 7, 9, 1 1 , 1 3 or 1 5. 

The present invention further includes isolated nucleic acid molecules that 
specifically hybridize to the complement of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13 or 15, 
15 particularly molecules that specifically hybridize over the open reading frames. Such 
molecules tiiat specifically hybridize to the complement of SEQ ID NO: I, 3, 5, 7, 9, 1 1, 
13 or 15 typically do so under stringent hybridization conditions. 

Specifically contemplated are genomic DNA, cDNA, mRNA and antisense 
molecules, as well as nucleic acids based on alternative backbones or including alternative 
20 bases, whether derived from natural sources or synthesized. Such hybridizing or 
complementary nucleic acids, however, are defined fiirther as being novel and unobvious 
over any prior art nucleic acid including that which encodes, hybridizes under appropriate 
stringency conditions, or is complementary to nucleic acid encoding a protein according to 
the present invention. 

25 Homology or identity at the nucleotide or amino acid sequence level is determined 

by BLAST (Basic Local Alignment Search Tool) analysis using the algorithm employed 
by the programs blastp, blastn, blastx, tblastn and tblastx (Altschul et al. (1997), Nucleic 
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Acids Res, 25: 3389-3402, and Karlin et al (1990), Proc, Natl Acad, ScL USA 87: 2264- 
2268, both fully incorporated by reference) which are tailored for sequence similarity 
searching. The approach used by the BLAST program is to first consider similar segments, 
with and without gaps, between a query sequence and a database sequence, then to 
5 evaluate the statistical significance of all matches that are identified and finally to 
summarize only those matches which satisfy a preselected threshold of significance. For a 
discussion of basic issues in similarity searching of sequence databases, see Altschul et aL 
(1994), Nat. Genet. 6: 119-129 which is fully incorporated by reference. The search 
parameters for histogram, descriptions, alignments, expect (i.e., the statistical significance 
10 threshold for reporting matches against database sequences), cutoff, matrix and filter (low 
complexity) are at the default settings. The default scoring matrix used by blastp, blastx, 
tblastn, and tblastx is the BLOSUM62 matrix (Henikoflf e/ al (1992), Proc. Natl Acad. 
Set USA 89: 10915-10919, fiilly incorporated by reference), recommended for query 
sequences over 8S nucleotides or amino acids in length. 

IS For blastn, the scoring matrix is set by the ratios of M (i.e, the reward score for a 

pair of matching residues) to N (i.e., the penalty score for mismatching residues), wherein 
the default values for M and N are S and -4, respectively. Four blastn parameters were 
adjusted as follows: Q=10 (gap creation penalty); R=10 (gap extension penalty); wink=l 
(generates word hits at every wink^ position along the query); and gapw=16 (sets the 

20 window width within which gapped alignments are generated). The equivalent Blastp 
parameter settings were (5=9; R=2; wink=l; and gapw=32. A Bestfit comparison between 
sequences, available in the GCG package version 10.0, uses DNA parameters GAP=50 
(gap creation penalty) and LEN=3 (gap extension penalty) and the equivalent settings in 
protein comparisons are GAP=8 and LEN=2. 

2S ''Stringent conditions*' are those that (1) employ low ionic strength and high 

temperature for washing, for example, 0.015 M NaCl/0.0015 M sodium citrate/0.1% SDS 
at SO^C, or (2) employ during hybridization a denaturing agent such as formamide, for 
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example, 50% (vol/vol) formamide with 0.1% bovine serum aIbumin/0.1% Ficoll/0.1% 
poIyvinylpyiTolidone/50 mM sodium phosphate buffer at pH 6,5 with 750 mM NaCl, 75 
mM sodium citrate at Another example is hybridization in 50% formamide, 5x 

SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% 
5 sodium pyrophosphate, 5x Denhardt's solution, sonicated salmon sperm DNA (50 ^g/ml), 
0.1% SDS, and 10% dextran sulfate at 42**C, with washes at 42^C in 0.2x SSC and 0.1% 
SDS. A skilled artisan can readily determine and vary the stringency conditions 
appropriately to obtain a clear and detectable hybridization signal. Preferred molecules 
are those that hybridize under the above conditions to the complement of SEQ ID NO: 1, 3, 
10 5, 7, 9, 11, 13 or 15 and which encode a functional or full-length protein. Even more 
preferred hybridizing molecules are those that hybridize under the above conditions to the 
complement strand of the open reading frame of SEQ ID NO: 1, 3, 5, 7, 9, 1 1, 13 or 15. 

As used herein, a nucleic acid molecule is said to be "isolated'* when the nucleic 
acid molecule is substantially separated from contaminant nucleic acid molecules 
1 5 encoding other polypeptides. 

The present invention further provides fragments of the disclosed nucleic acid 
molecules. As used herein, a fragment of a nucleic acid molecule refers to a small portion 
of the coding or non-coding sequence. The size of the fragment will be determined by the 
intended use. For example, if the fragment is chosen so as to encode an active portion of 
20 the protein, the fragment will need to be large enough to encode the functional region(s) of 
the protein. For instance, fragments which encode peptides corresponding to predicted 
antigenic regions may be prepared. If the fragment is to be used as a nucleic acid probe or 
PCR primer, then the fragment length is chosen so as to obtain a relatively small number 
of false positives during probing/priming (see the discussion in Section G). 

25 Fragments of the nucleic acid molecules of the present invention (/.e., synthetic 

oligonucleotides) that are used as probes or specific primers for the polymerase chain 
reaction (PCR), or to synthesize gene sequences encoding proteins of the invention, can 
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easily be synthesized by chemical techniques, for example, the phosphoramidite method 
of Matteucci et al, ((1981) J. Am. Chem, Soc. 103: 3185-3191) or using automated 
synthesis methods. In addition, larger DNA segments can readily be prepared by well 
known methods, such as synthesis of a group of oligonucleotides that define various 
5 modular segments of the gene, followed by ligation of oligonucleotides to build the 
complete modified gene. 

The nucleic acid molecules of the present invention may further be modified so as 
to contain a detectable label for diagnostic and probe purposes. A variety of such labels 
are known in the art and can readily be employed with the encoding molecules herein 
10 described. Suitable labels include, but are not limited to, biotin, radiolabeled or 
fluorescently labeled nucleotides and the like. A skilled artisan can readily employ any 
such label to obtain labeled variants of the nucleic acid molecules of the invention. 

C. Isolation of Other Related Nucleic Acid Molecules 

As described above, the identification and characterization of the nucleic acid 
15 molecule having SEQ ID NO: 1, 3, 5, 7, 9, 1 1, 13 or 15 allows a skilled artisan to isolate 
nucleic acid molecules that encode other members of the protein family in addition to the 
sequences herein described. Further, the presently disclosed nucleic acid molecules allow 
a skilled artisan to isolate nucleic acid molecules that encode other members of the family 
of proteins in addition to the proteins having SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16. 

20 For instance, a skilled artisan can readily use the amino acid sequence of SEQ ID 

NO: 2, 4, 6, 8, 10, 12, 14 or 16 to generate antibody probes to screen expression libraries 
prepared from appropriate cells. Typically, polyclonal antiserum from mammals such as 
rabbits immunized with the purified protein (as described below) or monoclonal 
antibodies can be used to probe a mammalian cDNA or genomic expression library, such 

25 as lambda gtU library, to obtain the appropriate coding sequence for other members of the 
protein family. The cloned cDNA sequence can be expressed as a fusion protein, 



15 



wo 2004/035789 



IPCT/KR2003/002161 



expressed directly using its own control sequences, or expressed by constructions using 
control sequences appropriate to the particular host used for expression of the enzyme. 

Alternatively, a portion of the coding sequence herein described can be 
synthesized and used as a probe to retrieve DNA encoding a member of the protein family 
5 from any mammalian organism. Oligomers containing approximately 18-20 nucleotides 
(encoding about a 6-7 amino acid stretch) are prepared and used to screen genomic DNA 
or cDNA libraries to obtain hybridization under stringent conditions or conditions of 
sufficient stringency to eliminate an undue level of false positives. 

Additionally, pairs of oligonucleotide primers can be prepared for use in PCR to 
10 selectively clone an encoding nucleic acid molecule. A PCR denature/anneal/extend cycle 
for using such PCR primers is well known in the art and can readily be adapted for use in 
isolating other encoding nucleic acid molecules. 

Nucleic acid molecules encoding other members of the protein family may also be 
identified in existing genomic or other sequence information using any available 
15 computational method, including but not limited to: PSI-BLAST (Altschul et al (1997), 
Nucl Acids Res. 25: 3389-3402); PHI-BLAST (Zhang et al (1998), Nucl Acids Res. 26: 
3986-3990), 3D-PSSM (Kelly et al (2000), J. Mol Biol 299: 499-520); and other 
computational analysis methods (Shi et al (1999), Biochem. Biophys. Res. Commun. 262: 
132-138 and Matsunami et. al (2000), Nature 404: 601-604). 

20 D. rDNA molecules Containing a Nucleic Acid Molecule 

The present invention further provides recombinant DNA molecules (rDNAs) that 
contain a coding sequence. As used herein, a rDNA molecule is a DNA molecule that has 
been subjected to molecular manipulation in situ. Methods for generating rDNA 
molecules are well known in the art, for example, see Sambrook et al.^ Molecular 
25 Cloning- A Laboratorv ManuaL Third Ed.. Cold Spring Harbor Laboratory Press, Cold 
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Spring Harbor, NY, 2001. In the preferred rDNA molecules, a coding DNA sequence is 
operably linked to expression control sequences and/or vector sequences. 

The choice of vector and/or expression control sequences to which one of the 
protein family encoding sequences of the present invention is operably linked depends 
5 directly, as is well known in the art, on the functional properties desired, e.g., protein 
expression, and the host cell to be transformed. A vector contemplated by the present 
invention is at least capable of directing the replication or insertion into the host 
chromosome, and preferably also expression, of the structural gene included in the rDNA 
molecule. 

10 Expression control elements that are used for regulating the expression of an 

operably linked protein encoding sequence are known in the art and include, but are not 
limited to, inducible promoters, constitutive promoters, secretion signals, and other 
regulatory elements. Preferably, the inducible promoter is readily controlled, such as 
being responsive to a nutrient in the host cell's medium. 

15 In one embodiment, the vector containing a coding nucleic acid molecule will 

include a prokaryotic replicon, i.e., a DNA sequence having the ability to direct 
autonomous replication and maintenance of the recombinant DNA molecule 
extrachromosomally in a prokaryotic host cell, such as a bacterial host cell, transformed 
therewith. Such replicons are well known in the art. In addition, vectors that include a 

20 prokaryotic replicon may also include a gene whose expression confers a detectable 
marker such as a drug resistance. Typical bacterial drug resistance genes are those that 
confer resistance to ampicillin, kanamycin, chloramphenicol or tetracycline. 

Vectors that include a prokaryotic replicon can further include a prokaryotic or 
bacteriophage promoter capable of directing the expression (transcription and translation) 
25 of the coding gene sequences in a bacterial host cell, such as E. coli. A promoter is an 
expression control element formed by a DNA sequence that permits binding of RNA 
polymerase and transcription to occur. Promoter sequences compatible with bacterial 
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hosts are typically provided in plasmid vectors containing convenient restriction sites for 
insertion of a DNA segment of the present invention. Typical of such vector plasmids are 
pUC8, pUC9, pBR322 and pBR329 available from BioRad Laboratories, (Richmond, CA), 
pPL and pKK223 available from Pharmacia (Piscataway, NJ). 

5 Expression vectors compatible with eukaryotic cells, preferably those compatible 

with vertebrate cells, can also be used to form rDNA molecules that contain a coding 
sequence, Eukaryotic cell expression vectors, including viral vectors, are well known in 
the art and are available from several commercial sources. Typically, such vectors are 
provided containing convenient restriction sites for insertion of the desired DNA segment. 
10 Typical of such vectors are pSVL and pKSV-10 (Pharmacia), pBPV-l/pML2d 
(International Biotechnologies, Inc.), pTDTl (ATCC, #31255), the vector pCDM8 
described herein, and the like eukaryotic expression vectors. Vectors may be modified to 
include tissue specific promoters if needed. 

Eukaryotic cell expression vectors used to construct the rDNA molecules of the present 
invention may further include a selectable marker that is effective in an eukaryotic cell, 
preferably a drug resistance selection marker. A preferred drug resistance marker is the 
gene whose expression results in neomycin resistance, /.e., the neomycin 
phosphotransferase (neo) gene. (Southern et aL (1982),,/ MoL Anal. Genet. 1:327-341). 
Alternatively, the selectable marker can be present on a separate plasmid, and the two 
vectors are introduced by co-transfection of the host cell, and selected by culturing in the 
appropriate drug for the selectable marker. 

E. Host Cells Containing an Exogenously Supplied Coding Nucleic Acid Molecule 

The present invention further provides host cells transformed with a nucleic acid 
molecule that encodes a protein of the present invention. The host cell can be either 
25 prokaryotic or eukaryotic. Eukaryotic cells useful for expression of a protein of the 
invention are not limited, so long as the cell line is compatible with cell culture methods 
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and compatible with the propagation of the expression vector and expression of the gene 
product. Preferred eukaryotic host cells include, but are not limited to, yeast, insect and 
mammalian cells, preferably vertebrate cells such as those from a mouse, rat, monkey or 
human cell line. Preferred eukaryotic host cells include Chinese hamster ovary (CHO) 
5 cells available from the ATCC as CCL61, NIH Swiss mouse embryo cells (NIH/3T3) 
available from the ATCC as CRL 1658. baby hamster kidney cells (BHK), and the like 
eukaryotic tissue culture cell lines. 

Any prokaryotic host can be used to express a rDNA molecule encoding a protein 
of the invention. The preferred prokaryotic host is E. coli. 
10 Transformation of appropriate cell hosts with a rDNA molecule of the present 

invention is accomplished by well known methods that typically depend on the type of 
vector used and host system employed. With regard to transformation of prokaryotic host 
cells, electroporation and salt treatment methods are typically employed (see. for example, 
Cohen et al. (1972), Proc. Natl. Acad. Sci. USA 69: 2110; and Sambrook et al, supra). 
15 With regard to transformation of vertebrate cells with vectors containing rDNAs, 
electroporation, cationic lipid or salt treatment methods are typically employed, see, for 
example, Graham et al. (1973), Virol. 52: 456; Wigler et al. (1979), Proc. Natl Acad. Sci. 
USA 76: 1373-1376. 

Successfully transformed cells, i.e., cells that contain a rDNA molecule of the 
20 present invention, can be Identified by well known techniques including the selection for a 
selectable marker. For example, cells resulting from the introduction of an rDNA of the 
present invention can be cloned to produce single colonies. Cells from those colonies can 
be harvested, lysed and their DNA content examined for the presence of the rDNA using a 
method such as that described by Southern. (1975) J. Mol. Biol. 98: 503 or Berent et al., 
25 (1985) Biotech. 3: 208, or the proteins produced from the cell assayed via an 
immunological method. 
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F. Production of Recombinant Proteins using a rDNA Molecule 

The present invention further provides methods for producing a protein of the 
invention using nucleic acid molecules herein described. In general terms, the production 
of a recombinant form of a protein typically involves the following steps: 

5 First, a nucleic acid molecule is obtained that encodes a protein of the invention, 

such as a nucleic acid molecule comprising, consisting essentially of or consisting of SEQ 
ID NO: 1, 3, 5, 7, 9, 11, 13 or 15, or nucleotides 390-4883 or 390-4880 of SEQ ID NO: 1, 
or nucleotides 12-4907 or 12-4904 of SEQ ID NO: 3, or nucleotides 424-1911 or 424- 
1908 of SEQ ID NO: 5, or nucleotides 405-1838 or 405-1835 of SEQ ID NO: 7, or 

10 nucleotides 89-1153 or 89-1150 of SEQ ID NO: 9, or nucleotides 223-1572 or 223-1569 
of SEQ ID NO: 1 1, or 418-1395 or 418-1392 of SEQ ID NO: 13, or nucleotides 271-1434 
or 271-1431 of SEQ ID NO: 15. If the encoding sequence is uninterrupted by introns, as 
are these open-reading-frames, it is directly suitable for expression in any host. 

The nucleic acid molecule is then preferably placed in operable linkage with 
15 suitable control sequences, as described above, to form an expression unit containing the 
protein open reading frame. The expression unit is used to transform a suitable host and 
the transformed host is cultured under conditions that allow the production of the 
recombinant protein. Optionally the recombinant protein is isolated from the medium or 
ftom the cells; recovery and purification of the protein may not be necessary in some 
20 instances where some impurities may be tolerated. 

Each of the foregoing steps can be done in a variety of ways. For example, the 
desired coding sequences may be obtained from genomic fragments and used directly in 
appropriate hosts. The construction of expression vectors that are operable in a variety of 
hosts is accomplished using appropriate replicons and control sequences, as set forth 
25 above. The control sequences, expression vectors, and transformation methods are 
dependent on the type of host cell used to express the gene and were discussed in detail 
earlier. Suitable restriction sites can, if not normally available, be added to the ends of the 
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coding sequence so as to provide an excisable gene to insert into these vectors. A skilled 
artisan can readily adapt any host/expression system known in the art for use with the 
nucleic acid molecules of the invention to produce recombinant protein. 

G. Methods to Identify Agents that Modulate the Expression of a Nucleic Acid 
S Encoding the Genes Associated with Cancer 

Another embodiment of the present invention provides methods for identifying 
agents that modulate the expression of a nucleic acid encoding a protein of the invention 
such as a protein having the amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 
16. Such assays may utilize any available means of monitoring for changes in the 
10 expression level of the nucleic acids of the invention. As used herein, an agent is said to 
modulate the expression of a nucleic acid of the invention if it is capable of up- or down- 
regulating expression of the nucleic acid in a cell. 

In one assay format, cell lines that contain reporter gene fusions between 
nucleotides from within the open reading frame defined by nucleotides 390-4883 of SEQ 

15 ID NO: 1 , nucleotides 12-4907 of SEQ ID NO: 3, nucleotides 424-191 1 of SEQ ID NO: 5, 
nucleotides 405-1838 of SEQ ID NO: 7, nucleotides 89-1153 of SEQ ID NO: 9, 
nucleotides 223-1572 of SEQ ID NO: 11, nucleotides 418-1395 of SEQ ID NO: 13, 
nucleotides 271-1434 of SEQ ID NO: 15, and/or the 5* and/or 3' regulatory elements and 
any assayable fusion partner may be prepared. Numerous assayable fusion partners are 

20 known and readily available including the firefly luciferase gene and the gene encoding 
chloramphenicol acetyltransferase (Alam et al (1990), Anal Biochem. 188: 245-254). 
Cell lines containing the reporter gene fusions are then exposed to the agent to be tested 
under appropriate conditions and time. Differential expression of the reporter gene 
between samples exposed to the agent and control samples identifies agents which 

25 modulate the expression of a nucleic acid of the invention. 
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. Additional assay formats may be used to monitor the ability of the agent to 
modulate the expression of a nucleic acid encoding a protein of the invention, such as the 
protein having SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16. For instance, mRNA expression 
may be monitored directly by hybridization to the nucleic acids of the invention. Cell 
5 lines are exposed to the agent to be tested under appropriate conditions and tinie and total 
RNA or mRNA is isolated by standard procedures such those disclosed in Sambrook et al. 
Molecular Cloning - A Laboratory Manual. Third Ed.. Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY, 2001. 

The preferred cells will be those derived from human tissue, for instance, biopsy 
10 tissue or cultured cells from patients with cancer. Cell lines such as ATCC breast ductal 
carcinoma cell lines (Catalogue Nos. CRL-2320, CRL-2338, and CRL-7345), ATCC 
colorectal adenocarcinoma cell lines (Catalogue Nos. CCL-222, CCL-224, CCL-225, 
CCL-234, CRL-7159, and CRL-7184), ATCC kidney clear cell carcinoma cell lines 
(Catalogue Nos. HTB-46 and HTB-47), ATCC renal cell adenocarcinoma cell lines 
15 (Catalogue Nos. CRL-1611, CRL-1932 and CRL-1933), ATCC liver hepatocellular 
carcinoma cell lines (Catalogue Nos. CRL-2233, CRL-2234, and HB-8065), ATCC lung 
adenocarcinoma cell lines (Catalogue Nos. CRL-5944, CRL-7380, and CRL-5907), 
ATCC lymphoma cell lines (Catalogue Nos. CRL-7936, CRL-7264, and CRL-7507), 
ATCC ovary adenocarcinoma cell lines (Catalogue Nos. HTB-161, HTB-75, and HTB-76), 
20 ATCC pancreas adenocarcinoma cell lines (Catalogue Nos. CRL-1687, CRL-2119, and 
HTP-79), prostate adenocarinoma cell lines (Catalogue Nos. CRL-1435, CRL-2422, and 
CRL-2220), and ATCC gastric adenocarcinoma cell lines (Catalogue Nos. CRL-1739, 
CRL-1863, and CRL-1864) may be used. Alternatively, other available cells or cell lines 
may be used. 

25 Probes to detect difiTerences in RNA expression levels between cells exposed to the 

agent and control cells may be prepared from the nucleic acids of the invention. It is 
preferable, but not necessary, to design probes which hybridize only with target nucleic 
acids under conditions of high stringency. Only highly complementary nucleic acid 
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hybrids form under conditions of high stringency. Accordingly, the stringency of the 
assay conditions determines the amount of complementarity which should exist between 
two nucleic acid strands in order to form a hybrid. Stringency should be chosen to 
maximize the difference in stability between the probe:target hybrid and probernon-target 
5 hybrids. 

Probes may be designed from the nucleic acids of the invention through methods 
known in the art. For instance, the G+C content of the probe and the probe length can 
affect probe binding to its target sequence. Methods to optimize probe specificity are 
commonly available in Sambrook et ai, supra, or Ausubel et aL, Short Protocols in 
10 Molecular Bioloev. Fourth Ed„ John Wiley & Sons, Inc., New York, 1999. 

Hybridization conditions are modified using known methods, such as those 
described by Sambrook et al and Ausubel et al as required for each probe. Hybridization 
of total cellular RNA or RNA enriched for polyA RNA can be accomplished in any 
available format. For instance, total cellular RNA or RNA enriched for polyA RNA can 

IS be affixed to a solid support and the solid support exposed to at least one probe 
comprising at least one, or part of one of the sequences of the invention under conditions 
in which the probe will specifically hybridize. Alternatively, nucleic acid fragments 
comprising at least one, or part of one of the sequences of the invention can be affixed to a 
solid support, such as a silicon chip, porous glass wafer or membrane. The solid support 

20 can then be exposed to total cellular RNA or polyA RNA from a sample under conditions 
in which the afRxed sequences will specifically hybridize. Such solid supports and 
hybridization methods are widely available, for example, those disclosed by Beattie, 
(1995) WO 95/11755. By examining for the ability of a given probe to specifically 
hybridize to an RNA sample from an untreated cell population and from a cell population 

25 exposed to the agent, agents which up- or down-regulate the expression of a nucleic acid 
encoding the protein having the sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16 are 
identified. 
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Hybridization for qualitative and quantitative analysis of mRNAs may also be 
carried out by using a RNase Protection Assay (i.e., RPA, see Ma et al. (1996), Methods 
10: 273-238). Briefly, an expression vehicle comprising cDNA encoding the gene product 
and a phage specific DNA dependent RNA polymerase promoter {e.g., T7, T3 or SP6 

5 RNA polymerase) is linearized at the 3' end of the cDNA molecule, downstream from the 
phage promoter, wherein such a linearized molecule is subsequently used as a template for 
synthesis of a labeled antisense transcript of the cDNA by in vitro transcription. The 
labeled transcript is then hybridized to a mixture of isolated RNA (i.e., total or 
fractionated mRNA) by incubation at 45°C overnight in a buffer comprising 80% 

10 formamide, 40 mM Pipes, pH 6.4, 0.4 M NaCl and 1 mM EDTA. The resulting hybrids 
are then digested in a buffer comprising 40 ^g/ml ribonuclease A and 2 ^g/ml 
ribonuclease. After deactivation and extraction of extraneous proteins, the samples are 
loaded onto urea/polyacrylamide gels for analysis. 

In another assay, to identify agents which affect the expression of the instant gene 
15 products, cells or cell lines are first identifled which express the gene products of the 
invention physiologically. Cells and/or cell lines so identified would be expected to 
comprise the necessary cellular machinery such that the fidelity of modulation of the 
transcriptional apparatus is mdntained with regard to exogenous contact of agent with 
Impropriate surface transduction mechanisms and/or the cytosolic cascades. Further, such 
20 cells or cell lines would be transduced or transfected with an expression vehicle (e.g., a 
plasmid or viral vector) construct comprising an operable non-translated 5' promoter- 
containing end of the structural gene encoding the instant gene products fused to one or 
more antigenic fragments, which are peculiar to the instant gene products, wherein said 
fragments are under the transcriptional control of said promoter and are expressed as 
25 polypeptides whose molecular weight can be distinguished from the naturally occurring 
polypeptides or may fiirther comprise an immunologically distinct tag or other detectable 
marker. Such a process is well known in the art (see Sambrook et al., supra). 
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Cells or cell lines transduced or transfected as outlined above are then contacted 
with agents under appropriate conditions. For example, the agent in a pharmaceutically 
acceptable excipient is contacted with cells in an aqueous physiological buffer such as 
phosphate buffered saline (PBS) at physiological pH, Eagles balanced salt solution (BSS) 

5 at physiological pH, PBS or BSS comprising serum or conditioned media comprising PBS 
or BSS and/or serum incubated at 37''C. Said conditions may be modulated as deemed 
necessary by one of skill in the art. Subsequent to contacting the cells with the agent, said 
cells will be disrupted and the polypeptides of the lysate are fractionated such that a 
polypeptide fraction is pooled and contacted with an antibody to be further processed by 

10 immunological assay (e.g., ELISA, immunoprecipitation or Western blot). The pool of 
proteins isolated from the "agent-contacted" sample will be compared with a control 
sample where only the excipient is contacted with the cells and an increase or decrease in 
the immunologically generated signal from the "agent-contacted" sample compared to the 
control will be used to distinguish the effectiveness of the agent. 

IS H. Methods to Identify Agents that Modulate the Level or at Least One Activity of 
the Cancer Associated Proteins 

Another embodiment of the present invention provides methods for identifying 
agents that modulate the level or at least one activity of a protein of the invention such as 
the protein having the amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16. 
20 Such methods or assays may utilize any means of monitoring or detecting the desired 
activity and are particularly useful for identifying agents that treat cancer. 

In one format, the relative amounts of a protein of the invention between a cell 
population that has been exposed to the agent to be tested compared to an un-exposed 
control cell population may be assayed. In this format, probes such as specific antibodies 
25 are used to monitor the differential expression of the protein in the different cell 
populations. Cell lines or populations are exposed to the agent to be tested under 
appropriate conditions and time. Cellular lysates may be prepared from the exposed cell 
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line or population and a control, unexposed cell line or population. The cellular lysates 
are then analyzed with the probe. 

Antibody probes are prepared by immunizing suitable mammalian hosts in 
appropriate immunization protocols using the peptides, polypeptides or proteins of the 

5 invention if they are of sufficient length, or, if desired, or if required to enhance 
immunogenicity, conjugated to suitable carriers. Methods for preparing immunogenic 
conjugates with carriers such as BSA, KLH, or other carrier proteins are well known in the 
art. In some circumstances, direct conjugation using, for example, carbodiimide reagents 
may be effective; in other instances linking reagents such as those supplied by Pierce 

10 Chemical Co. (Rockford, IL), may be desirable to provide accessibility to the hapten. The 
hapten peptides can be extended at either the amino or carboxy terminus with a cysteine 
residue or interspersed with cysteine residues, for example, to facilitate linking to a carrier. 
Administration of the immunogens is conducted generally by injection over a suitable 
time period and with use of suitable adjuvants, as is generally understood in the art. 

15 During the immunization schedule, titers of antibodies are taken to determine adequacy of 
antibody formation. 

While the polyclonal antisera produced in this way may be satisfactory for some 
applications, for pharmaceutical compositions, use of monoclonal preparations is preferred. 
Immortalized cell lines which secrete the desired monoclonal antibodies may be prepared 

20 using the standard method of Kohler and Milstein ((1975) Nature 256: 495-497) or 
modifications which effect immortalization of lymphocytes or spleen cells, as is generally 
knovm. The immortalized cell lines secreting the desired antibodies are screened by 
immunoassay in which the antigen is the peptide hapten, polypeptide or protein. When 
the appropriate immortalized cell culture secreting the desired antibody is identified, the 

25 cells can be cultured either in vitro or by production in ascites fluid. 

The desired monoclonal antibodies are then recovered from the culture supernatant 
or from the ascites supernatant. Fragments of the monoclonal antibodies or the polyclonal 
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antisera which contain the immunologically significant (antigen-binding) portion can be 
used as antagonists, as well as the intact antibodies. Use of immunologically reactive 
(antigen-binding) antibody fragments, such as the Fab, Fab', or F(ab')2 fragments is often 
preferable, especially in a therapeutic context, as these fragments are generally less 
S immunogenic than the whole immunoglobulin. 

The antibodies or antigen-binding fragments may also be produced, using current 
technology, by recombinant means. Antibody regions that bind specifically to the desired 
regions of the protein can also be produced in the context of chimeras with multiple 
species origin, such as humanized antibodies. 

1 0 Agents that are assayed in the above method can. be randomly selected or rationally 

selected or designed. As used herein, an agent is said to be randomly selected when the 
agent is chosen randomly without considering the specific sequences involved in the 
association of a protein of the invention alone or with its associated substrates, binding 
partners, etc. An example of randomly selected agents is the use a chemical library or a 

1 5 peptide combinatorial library, or a growth broth of an organism. 

As used herein, an agent is said to be rationally selected or designed when the 
agent is chosen on a nonrandom basis which takes into account the sequence of the target 
site and/or its conformation in connection with the agent's action. Agents can be 
rationally selected or rationally designed by utilizing the peptide sequences that make up 
20 these sites. For example, a rationally selected peptide agent can be a peptide whose amino 
acid sequence is identical to or a derivative of any functional consensus site. 

The agents of the present invention can be, as examples, peptides, small molecules, 
vitamin derivatives, as well as carbohydrates. Dominant negative proteins, DNAs 
encoding these proteins, antibodies to these proteins, peptide fragments of these proteins 
25 or mimics of these proteins may be introduced into cells to affect fiinction. "Mimic" used 
herein refers to the modification of a region or several regions of a peptide molecule to 
provide a structure chemically different from the parent peptide but topographically and 
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functionally similar to the parent peptide (see Grant in: Molecular Biology and 
Biotechnology, Meyers, ed.. pp. 659-664, VCH Publishers, Inc., New York, 1995). A 
skilled artisan can readily recognize that there is no limit as to the structural nature of the 
agents of the present invention. 

S The peptide agents of the invention can be prepared using standard solid phase (or 

solution phase) peptide synthesis methods, as is known in the art. In addition, the DNA 
encoding these peptides may be synthesized using commercially available oligonucleotide 
synthesis instrumentation and produced recombinantly using standard recombinant 
production systems. The production using solid phase peptide synthesis is necessitated if 
10 non-gene-encoded amino acids are to be included. 

Another class of agents of the present invention are antibodies immunoreactive 
with critical positions of proteins of the invention, e.g., cytoplasmic domain, spacer 
domain, a-helical coiled-coil domain, or the receptor domain, as described herein. 
Antibody agents are obtained by immunization of suitable mammalian subjects with 
IS peptides, containing as antigenic regions, those portions of the protein intended to be 
targeted by the antibodies. 

I. Uses for Agents that Modulate the Expression or at Least one Activity of the 
Proteins Associated with Cancer 

As provided in the Examples, the proteins and nucleic acids of the invention, such 
20 as the proteins having the amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16, 
are differentially expressed in cancerous tissue. Agents that up- or down- regulate or 
modulate the expression of the protein or at least one activity of the protein, such as 
agonists or antagonists, may be used to modulate biological and pathologic processes 
associated with the protein's function and activity. This includes agents identified 
25 employing homologues and analogues of the present invention. 
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As used herein, a subject can be any mammal, so long as the mammal is in need of 
modulation of a pathological or biological process mediated by a protein of the invention. 
The term "mammaP' is defined as an individual belonging to the class Mammalia. The 
invention is particularly useful in the treatment of human subjects. 

S Pathological processes refer to a category of biological processes which produce a 

deleterious effect. For example, expression of a protein of the invention may be 
associated with cell growth or hyperplasia. As used herein, an agent is said to modulate a 
pathological process when the agent reduces the degree or severity of the process. For 
instance, cancer may be prevented or disease progression modulated by the administration 
10 of agents which up- or down-regulate or modulate in some way the expression or at least 
one activity of a protein of the invention. 

The agents of the present invention can be provided alone, or in combination with 
other agents that modulate a particular pathological process. For example, an agent of the 
present invention can be administered in combination with other known drugs. As used 
IS herein, two agents are said to be administered in combination when the two agents are 
administered simultaneously or are administered independently in a fashion such that the 
agents will act at the same time. 

The agents of the present invention can be administered via parenteral, 
subcutaneous, intravenous, intramuscular, intraperitoneal, transdermal, or buccal routes. 
20 Alternatively, or concurrently, administration may be by the oral route. The dosage 
administered will be dependent upon the age, health, and weight of the recipient, kind of 
concurrent treatment, if any, frequency of treatment, and the nature of the effect desired. 

The present invention further provides compositions containing one or more agents 
which modulate expression or at least one activity of a protein of the invention. While 
25 individual needs vary, determination of optimal ranges of effective amounts of each 
component is within the skill of the art. Typical dosages comprise 0.1 to 100 ^ig/kg body 
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wt. The preferred dosages comprise 0.1 to 10 ^g/kg body wt. The most preferred dosages 
comprise 0.1 to 1 ^g/kg body wt. 

In addition to the pharmacologically active agent, the compositions of the present 
invention may contain suitable pharmaceutically acceptable carriers comprising excipients 
5 and auxiliaries which facilitate processing of the active compounds into preparations 
which can be used pharmaceutically for delivery to the site of action. Suitable 
formulations for parenteral administration include aqueous solutions of the active 
compounds in water-soluble form, for example, water-soluble salts. In addition, 
suspensions of the active compounds as appropriate oily injection suspensions may be 
10 administered. Suitable lipophilic solvents or vehicles include fatty oils, for example, 
sesame oil. or synthetic fatty acid esters, for example, ethyl oleate or triglycerides. 
Aqueous injection suspensions may contain substances which increase the viscosity of the 
suspension include, for example, sodium carboxymethyl cellulose, sorbitol, and/or dextran. 
Optionally, the suspension may also contain stabilizers. Liposomes can also be used to 
1 5 encapsulate the agent for delivery into the cell. 

The pharmaceutical formulation for systemic administration according to the 
invention may be formulated for enteral, parenteral or topical administration. Indeed, all 
three types of formulations may be used simultaneously to achieve systemic 
administration of the active ingredient. 
20 Suitable formulations for oral administration include hard or soft gelatin capsules, 

pills, tablets, including coated tablets, elixirs, suspensions, syrups or inhalations and 
controlled release forms thereof. 

In practicing the methods of this invention, the compounds of this invention may 
be used alone or in combination, or in combination with other therapeutic or diagnostic 
25 agents. In certain preferred embodiments, the compounds of this invention may be 
coadministered along with other compounds typically prescribed for these conditions 
according to generally accepted medical practice. The compounds of this invention can be 
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Utilized in vivo, ordinarily in mammals, such as humans, sheep, horses, cattle, pigs, dogs, 
cats, rats and mice, or in vitro. 

J. Methods to Identify Binding Partners 

Another embodiment of the present invention provides methods for isolating and 
5 identifying binding partners of proteins of the invention. In general, a protein of the 
invention is mixed with a potential binding partner or an extract or fraction of a cell under 
conditions that allow the association of potential binding partners with the protein of the 
invention. After mixing, peptides, polypeptides, proteins or other molecules that have 
become associated with a protein of the invention are separated from the mixture. The 
10 binding partner that bound to the protein of the invention can then be removed and further 
analyzed. To identify and isolate a binding partner, the entire protein, for instance a 
protein comprising the entire amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 
16 can be used. Alternatively, a fragment of the protein can be used. 

As used herein, a cellular extract refers to a preparation or fraction which is made 
15 from a lysed or disrupted cell. The preferred source of cellular extracts will be cells 
derived from human tumors or transformed cells, for instance, biopsy tissue or tissue 
culture cells from carcinomas. Alternatively, cellular extracts may be prepared from 
normal tissue or available cell lines. 

A variety of methods can be used to obtain an extract of a cell. Cells can be 
20 disrupted using either physical or chemical disruption methods. Examples of physical 
disruption methods include, but are not limited to, sonication and mechanical shearing. 
Examples of chemical lysis methods include, but are not limited to, detergent lysis and 
enzyme lysis. A skilled artisan can readily adapt methods for preparing cellular extracts in 
order to obtain extracts for use in the present methods. 

25 Once an extract of a cell is prepared, the extract is mixed with the protein of the 

invention under conditions in which association of the protein with the binding partner can 
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occur. A variety of conditions can be used, the most preferred being conditions that 
closely resemble conditions found in the cytoplasm of a human cell. Features such as 
osmolarity, pH, temperature, and the concentration of cellular extract used, can be varied 
to optimize the association of the protein with the binding partner. 

5 After mixing under appropriate conditions, the bound complex is separated from 

the mixture. A variety of techniques can be utilized to separate the mixture. For example, 
antibodies specific to a protein of the invention can be used to immunoprecipitate the 
binding partner complex. Alternatively, standard chemical separation techniques such as 
chromatography and density/sediment centrifiigation can be used. 

10 After removal of non-associated cellular constituents found in the extract, the 

binding partner can be dissociated from the complex using conventional methods. For 
example, dissociation can be accomplished by altering the salt concentration or pH of the 
mixture. 

To aid in separating associated binding partner pairs from the mixed extract, the 
15 protein of the invention can be immobilized on a solid support. For example, the protein 
can be attached to a nitrocellulose matrix or acrylic beads. Attachment of the protein to a 
solid support aids in separating peptide/binding partner pairs from other constituents found 
in the extract. The identified binding partners can be either a single protein or a complex 
made up of two or more proteins. Alternatively, binding partners may be identified using 
20 a Far- Western assay according to the procedures of Takayama et al (1997), Methods Mol 
Biol 69: 171-184 or Sauder et al (1996), 7. Gen. Virol 77: 991-996 or identified through 
the use of epitope tagged proteins or GST ftision proteins. 

Alternatively, the nucleic acid molecules of the invention can be used in a yeast 
two-hybrid system or other in vivo protein-protein detection system. The yeast two-hybrid 
25 system has been used to identify other protein partner pairs and can readily be adapted to 
employ the nucleic acid molecules herein described. 
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K. Use of the Binding Partners of the Cancer Associated Proteins 

Once isolated, the binding partners of the proteins of the invention, and 
homologues and analogues thereof, obtained using the above described methods can be 
used for a variety of purposes. The binding partners can be used to generate antibodies 
5 that bind to the binding partner using techniques known in the art. Antibodies that bind the 
binding partner can be used to assay the activity of the protein of the invention, as a 
therapeutic agent to modulate a biological or pathological process mediated by the protein 
of the invention, or to purify the binding partner. These uses are described in detail below. 

L, Methods to Identify Agents that Block the Associations between the Binding 
10 Partners and the Cancer Associated Proteins 

Another embodiment of the present invention provides methods for identifying 
agents that reduce or block the association of a protein of the invention with a binding 
partner. Specifically, a protein of the invention is mixed with a binding partner in the 
presence and absence of an agent to be tested. After mixing under conditions that allow 
15 association of the proteins, the two mixtures are analyzed and compared to determine if 
the agent reduced or blocked the association of the protein of the invention with the 
binding partner. Agents that block or reduce the association of the protein of the invention 
with the binding partner will be identified as decreasing the amount of association present 
in the sample containing the tested agent. 

20 As used herein, an agent is said to reduce or block the association between a 

protein of the invention and a binding partner when the presence of the agent decreases the 
extent to which or prevents the binding partner from becoming associated with the protein 
of the invention. One class of agents will reduce or block the association by binding to the 
binding partner while another class of agents will reduce or block the association by 

25 binding to the protein of the invention. 
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The binding partner used in the above assay can either be an isolated and fully 
characterized protein or can be a partially characterized protein that binds to the protein of 
the invention or a binding partner that has been identified as being present in a cellular 
extract. It will be apparent to one of ordinary skill in the art that so long as the binding 
5 partner has been characterized by an identifiable property, e.g., molecular weight, the 
present assay can be used. 

Agents that are assayed in the above method can be randomly selected or rationally 
selected or designed. As used herein, an agent is said to be randomly selected when the 
agent is chosen randomly without considering the specific sequences involved in the 
10 association of the protein of the invention with the binding partner. An example of 
randomly selected agents is the use of a chemical library or a peptide combinatorial library, 
or a growth broth of an organism. 

As used herein, an agent is said to be rationally selected or designed when the 
agent is chosen on a nonrandom basis which takes into account the sequence of the target 

1 5 site and/or its conformation in connection with the agent's action. Agents can be rationally 
selected or rationally designed by utilizing the peptide sequences that make up the contact 
sites of the binding partner with the protein of the invention. For example, a rationally 
selected peptide agent can be a peptide whose amino acid sequence is identical to the 
contact site of the protein of the invention on the binding partner. Such an agent will 

20 reduce or block the association of the protein of the invention with the binding partner by 
binding to the binding partner. 

The agents of the present invention can be, as examples, peptides, small molecules, 
vitamin derivatives, as well as carbohydrates. A skilled artisan can readily recognize that 
there is no limit as to the structural nature of the agents of the present invention. 

25 One class of agents of the present invention are peptide agents whose amino acid 

sequences are chosen based on the amino acid sequence of the protein of die invention. 
The peptide agents of the invention can be prepared using standard solid phase (or 
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solution phase) peptide synthesis methods, as is known in the art. In addition, the DNA 
encoding these peptides may be synthesized using commercially available oligonucleotide 
synthesis instrumentation and produced recombinantly using standard recombinant 
production systems. The production using solid phase peptide synthesis is necessitated if 
S non-gene encoded amino acids are to be included. 

Another class of agents of the present invention are antibodies immunoreactive 
with critical positions of the protein of the invention or the binding partner. As described 
above, antibodies are obtained by immunization of suitable mammalian subjects with 
peptides, containing as antigenic regions, those portions of the protein of the invention or 
10 the binding partner, intended to be targeted by the antibodies. Critical regions include the 
contact sites involved in the association of the protein of the invention with the binding 
partner. 

As discussed below, the important minimal sequence of residues involved in 
activity of the protein of the invention define a functional linear domain that can be 
15 effectively used as a bait for two hybrid screening and identification of potential 
associated molecules. Use of such fi-agments will significantly increase the specificity of 
the screening as opposed to using the full-length molecule and is therefore preferred. 
Similarly, this linear sequence can be also used as an affinity matrix also to isolate binding 
proteins using a biochemical affinity purification strategy. 

20 M. Uses for Agents that Block the Associations between the Binding Partners and 
the Cancer Associated Proteins 

As provided in the Examples, the proteins and nucleic acids of the invention, such 
as the proteins having the amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16, 
are differentially expressed in cancerous tissue. Agents that reduce or block the 
25 interactions of a protein of the invention, including those identified employing 
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homologues and analogues of the protein, with a binding partner may be used to modulate 
biological and pathologic processes associated with the protein's function and activity. 

As used herein, a subject can be any mammal, so long as the mammal is in need of 
modulation of a pathological or biological process mediated by a protein of the invention. 
5 The term "mammal" is meant an individual belonging to the class Mammalia. The 
invention is particularly useful in the treatment of human subjects. 

Pathological processes refer to a category of biological processes which produce a 
deleterious effect. For example, expression of a protein of the invention may be 
associated with cell growth or hyperplasia. As used herein, an agent is said to modulate a 
10 pathological process when the agent reduces the degree or severity of the process. For 
instance, cancer may be prevented or disease progression modulated by the administration 
of agents that reduce or block the interactions of a protein of the invention with a binding 
partner. 

The agents of the present invention can be administered via parenteral, 
15 subcutaneous, intravenous, intramuscular, intraperitoneal, transdermal, or buccal routes. 
Alternatively, or concurrently, administration may be by the oral route. The dosage 
administered will be dependent upon the age, health, and weight of the recipient, kind of 
concurrent treatment, if any, frequency of treatment, and the nature of the effect desired. 

The present invention further provides compositions containing one or more agents 
20 that block association of a protein of the invention with a binding partner. While 
individual needs vary, determination of optimal ranges of effective amounts of each 
component is within the skill of the art. Typical dosages comprise 0.1 to 100 jig/kg body 
wt. The preferred dosages comprise 0.1 to 10 jigAcg body wt. The most preferred dosages 
comprise 0.1 to 1 jxg/kg body wt. 
25 In addition to the pharmacologically active agent, the compositions of the present 

invention may contain suitable pharmaceutically acceptable carriers comprising excipients 
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and auxiliaries which facilitate processing of the active compounds into preparations 
which can be used pharmaceutically for delivery to the site of action. Suitable 
formulations for parenteral admmistration include aqueous solutions of the active 
compounds in water soluble form, for example, water soluble salts. In addition, 

5 suspensions of the active compounds as appropriate oily injection suspensions may be 
administered. Suitable lipophilic solvents or vehicles include fatty oils, for example, 
sesame oil, or synthetic fatty acid esters, for example, ethyl oleate or triglycerides. 
Aqueous injection suspensions may contain substances which increase the viscosity of the 
suspension include, for example, sodium carboxymethyl cellulose, sorbitol, and/or dextran. 

10 Optionally, the suspension may also contain stabilizers. Liposomes can also be used to 
encapsulate the agent for delivery into the cell. 

TTie pharmaceutical formulation for systemic administration according to the 
invention may be formulated for enteral, parenteral or topical administration. Indeed, all 
three types of formulations may be used simultaneously to achieve systemic 
IS administration of the active ingredient. 

Suitable formulations for oral administration include hard or soft gelatin capsules, 
pills, tablets, including coated tablets, elixirs, suspensions, syrups or inhalations and 
controlled release forms thereof. 

In practicing the methods of this invention, the compounds of this invention may 
20 be used alone or in combination, or in combination with other therapeutic or diagnostic 
agents. In certain preferred embodiments, the compounds of this invention may be 
coadministered along with other compounds typically prescribed for these conditions 
according to generally accepted medical practice. The compounds of this invention can be 
utilized in vivo, ordinarily in mammals, such as humans, sheep, horses, cattle, pigs, dogs, 
25 cats, rats and mice, or in vitro. 

N. Rational Drug Design and Combinatorial Chemistry 
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The present invention further encompasses rational drug design and combinatorial 
chemistry. Those of skill will recognize appropriate methods to utilize and exploit aspects 
of the present invention in identifying compounds which can be developed for cancer 
treatment. Rational drug design involving polypeptides requires identifying and defining 

5 a first peptide with which the designed drug is to interact, and using the first target peptide 
to define the requirements for a second peptide. With such requirements defined, one can 
find or prepare an appropriate peptide or non-peptide that meets all or substantially all of 
the defined requirements. Thus, one goal of rational drug design is to produce structural or 
fimctionai analogs of biologically active polypeptides of interest or of small molecules 

10 with which tiiey interact (e.g., agonists, antagonists, null compounds) in order to fashion 
dmgs that are. for example, more or less potent forms of the ligand. (See, e.g., Hodgson 
(1991), Bio. Technology 9:19-21). Combinatorial chemistry is the science of synthesizing 
and testing compounds for bioactivity en masse, instead of one by one, tiie aim being to 
discover drugs and materials more quickly and inexpensively tiian was formeriy possible. 

1 5 Rational drug design and combinatorial chemistry have become more intimately related in 
recent years due to the development of approaches in computer-aided protein modeling 
and drug discovery. (See e.g., US Pat. No. 4,908.773; 5,884,230; 5.873,052; 5,331,573; 
and 5,888,738). 

The use of molecular modeling as a tool for rational drug design and combinatorial 
20 chemistry has dramatically increased due to the advent of computer graphics. Not only is 
it possible to view molecules on computer screens in three dimensions but it is also 
possible to examine tiie interactions of macromolecules such as enzymes and receptors 
and rationally designed derivative molecules to test. (See Doorman (1992), Chem. Eng. 
News 70:18-26). A vast amount of user-firiendly software and hardware is now available 
25 and virtually all pharmaceutical companies have computer modeling groups devoted to 
rational drug design. Molecular Simulations Inc. (www.msi.com), for example, sells 
several sophisticated programs that allow a user to start firom an amino acid sequence, 
build a two or tiiree-dimensional model of the protein or polypeptide, compare it to other 
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two and three-dimensional models, and analyze the interactions of compounds, drugs, and 
peptides with a three dimensional model in real time. Accordingly, in some embodiments 
of the invention, software is used to compare regions of the invention protein and 
molecules that interact therewith (collectively referred to as "binding partners" -e.g., anti- 

5 protein antibodies), and fragments or derivatives of these molecules with other molecules, 
such as peptides, peptidomimetics, and chemicals, so that therapeutic interactions can be 
predicted and designed. (See Schneider (1998), Genetic Engineering News December: 
page 20; Tempczyk et al (1997), Molecular Simulations Inc. Solutions April; and 
Butenhof (1998), Molecular Simulations Inc. Case Notes (August 1998) for a discussion 

1 0 of molecular modeling). 

O. Gene Therapy 

In another embodiment, genetic therapy can be used as a means for modulating 
biological and pathologic processes associated with the protein's function and activity. 
This comprises inserting into a cancerous cell a gene construct encoding a protein 

15 comprising all or at least a portion of the sequences of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 
or 16, or alternatively a gene construct comprising all or a portion of the non-coding 
region of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13 or 15, operably linked to a promoter or 
enhancer element such that expression of said protein causes suppression of said cancer 
and wherein said promoter or enhancer element is a promoter or enhancer element 

20 modulating said gene construct. 

In the constructs described, expression of said protein can be directed from any 
suitable promoter (e.g., the human cytomegalovirus (CMV), simian virus 40 (SV40), or 
metallothionein promoters), and regulated by any appropriate mammalian regulatory 
element. For example, if desired, enhancers known to preferentially direct gene expression 
25 in neural cells, T cells, or B cells may be used to direct the expression. The enhancers used 
could include, without limitation, those that are characterized as tissue or cell specific in 
their expression. Alternatively, if a genomic clone of LFGl, LFG2, LFG3, LFG4, LFG5 
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or LFG6 is used as a therapeutic construct (for example, following its isolation by 
hybridization with the nucleic acid molecule of the invention described above), regulation 
may be mediated by the cognate regulatory sequences or, if desired, by regulatory 
sequences derived from a heterologous source, including any of the promoters or 
S regulatory elements described above. 

Insertion of the construct into a cancerous cell is accomplished in v/vo, for 
example using a viral or plasmid vector. Such methods can also be applied to in vitro uses. 
Thus, the methods of the present invention are readily applicable to different forms of 
gene therapy, either where cells are genetically modified ex vivo and then administered to 
10 a host or where the gene modification is conducted in vivo using any of a number of 
suitable methods involving vectors especially suitable to such therapies. 

Retroviral vectors, adenoviral vectors, adeno-associated viral vectors, or other viral 
vectors with the appropriate tropism for cells likely to be involved in cancer (for example, 
epithelial cells) may be used as a gene transfer delivery system for a therapeutic gene 

15 construct. Numerous vectors useful for this purpose are generally known (Cozzi PJ, et al., 
(2002) Prostate, 53(2):95-100; Bitzer M, Lauer U., (2002) Dtsch Med Wochenschr 
127(31-32):1623-1624; Mezzina and Danos (2002), Trends Genet. 8:241-256; Loser et al. 
(2002) CurK Gene Ther. 2:161-171; Pfeifer and Verma (2001), Annu. Rev. Genomics Hum, 
Genet. 2:177-211). Retroviral vectors are particularly well developed and have been used 

20 in clinical settings (Anderson et al. (1995), U.S. Patent No. 5,399,346). Non-viral 
approaches may also be employed for the introduction of therapeutic DNA into cells 
otherwise predicted to undergo cancer (Jeschke et al. (20002) Curr. Gene Ther. 1 :267- 
278; Wu et al. (1988), J. Biol. Chem. 263:14621-14624; Wu et al. (1989), J. Biol Chem. 
264:16985-16987). For example, a gene may be introduced into a neuron or a T cell by 

25 lipofection, asialorosonucoid polylysine conjugation, or, less preferably, microinjection 
under surgical conditions. 
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For any of the methods of application described above, the therapeutic nucleic acid 
construct is preferably applied to the site of the cancer event (for example, by injection). 
However, it may also be applied to tissue in the vicinity of the cancer event or to a blood 
vessel supplying the cells predicted to undergo cancer. 

S P. Transgenic Animals 

Transgenic animals containing mutant, knock-out or modified genes corresponding 
to the cDNA sequence of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13 or 15, or the open reading 
frame encoding the polypeptide sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16, or 
fragments thereof having a consecutive sequence of at least about 3, 4, 5, 6, 10, 15, 20, 25, 

10 30, 35 or more amino acid residues, are also included in the invention. Transgenic 
animals are genetically modified animals into which recombinant, exogenous or cloned 
genetic material has been experimentally transferred. Such genetic material is often 
referred to as a **transgene." The nucleic acid sequence of the transgene, in this case a 
form of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13 or 15, may be integrated either at a locus of a 

15 genome where that particular nucleic acid sequence is not otherwise normally found or at 
the normal locus for the transgene. The transgene may consist of nucleic acid sequences 
derived from the genome of the same species or of a different species than the species of 
the target animal. 

In some embodiments, transgenic animals in which all or a portion of a gene 
20 comprising SEQ ID NO: 1, 3, 5, 7, 9, 1 1, 13 or 15 is deleted may be constructed. In those 
cases where the gene corresponding to SEQ ID NO: 1, 3, 5. 7, 9, 1 1, 13 or 15 contains one 
or more introns, the entire gene- all exons, introns and the regulatory sequences- may be 
deleted. Alternatively, less than the entire gene may be deleted. For example, a single 
exon and/or intron may be deleted, so as to create an animal expressing a modified version 
25 of a protein of the invention. 
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The tem "germ cell line transgenic animal" refers to a transgenic animal in wh.ch 
the genetic alteration or genetic information was introduced into a germ line cell, thereby 
conferring the ability of the transgenic animal to transfer the genetic information to 
offspring. If such offspring in feet possess some or all of that alteration or genetic 
information, then they too are transgenic animals. 

The alteration or genetic information may be foreign to the species of animal to 
which the recipient belongs, foreign only to the particular individual recipient, or may be 
genetic information already possessed by the recipient. In the last case, the altered or 
introduced gene may be expressed differently than the native gene. 

Transgenic animals can be produced by a variety of different methods including 
transfection. electroporation. microinjection, gene targeting in embryonic stem cells and 
recombinant viral and retroviral infection e.g., U.S. Patent No. 4.736.866; U.S. 
Patent No. 5.602,307; MuUins et al. (1993). Ifypemnsion 22: 630-633; Brenin et al. 
(1997). Surg. Oncol. 6: 99-110; Peromhinant Gene FxprecHon Protocols (Methods m 
p^.J...,Riol.av Vol.62\ Tuan. ed., Humana Press. Totowa. NJ, 1997). 

A number of recombinant or transgenic mice have been produced, including those 
which express an activated oncogene sequence (U.S. Patent No. 4.736.866); express 
simian SV40 T-antigen (U.S. Patent No. 5.728.915); lack the expression of interferon 
regulatory factor 1 (IRF-1) (U.S. Patent No. 5.731.490); exhibit dopaminergic dysfunction 
(U S Patent No. 5.723.719); express at least one human gene which participates m blood 
pressure control (U.S. Patent No. 5.731.489); display greater similarity to the conditions 
existing in naturally occurring Alzheimer's di^ase (U.S. Patent No. 5.720,936); have a 
reduced capacity to mediate cellular adhesion (U.S. Patent No. 5,602,307); possess a 
bovine growth hormone gene (Clutter et al (1996), Genetics 143: 1753-1760); or, are 
capable of generating a fully human antibody response (McCarthy (1997). Uncet 349: 
405). 
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While mice and rats remain the animals of choice for most transgenic 
experimentation, in some instances it is preferable or even necessary to use alternative 
animal species. Transgenic procedures have been successfully utilized in a variety of non- 
murine animals, including sheep, goats, pigs, dogs. cats, monkeys, chimpanzees, hamsters, 
5 rabbits, cov^ and guinea pigs (see, e.g.. Kim et al (1997). Mol. Reprod Dev. 46: 5 1 5-526; 
Houdebine (1995). Reprod. Nutr. Dev. 35: 609-617; Fetters (1994). Reprod. Fertil. Dev. 6: 
643-645; Schnieke et al. (1997). Science 278: 2130-2133; and Amoah (1997), J. Animal 
Set. 75: 578-585). 

The method of introduction of nucleic acid fragments into recombination 
10 competent mammalian cells can be by any method which favors co-transformation of 
multiple nucleic acid molecules. Detailed procedures for producing transgenic animals 
are readily available to one skilled in the art, including the disclosures in U.S. Patent No. 
5,489,743 and U.S. Patent No. 5,602.307. 

Q. Diagnostic Methods 

1 5 As the genes and proteins of the invention are differentially expressed in cancerous 

tissues compared to non-cancerous tissues, the genes and proteins of the invention may be 
used to diagnose or monitor cancer, to track disease progression, or to differentiate 
cancerous tissue from non-cancerous tissue samples. One means of diagnosing cancer 
using the nucleic acid molecules or proteins of the invention involves obtaining tissue 
20 from living subjects. 

Assays to detect nucleic acid or protein molecules of the invention may be in any 
available format. Typical assays for nucleic acid molecules include hybridization or PGR 
based formats. Typical assays for the detection of proteins, polypeptides or peptides of 
the invention include the use of antibody probes in any available format such as in situ 
25 binding assays, etc. (see Harlow & Lane. Antibodies - A Laboratory Manual, Cold Spring 
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Harbor Laboratory Press. Cold Spring Harbor, NY. 1988). In preferred embodiments, 
assays are carried-out with appropriate controls. 

Generally, the diagnostics of the invention can be classified according to whether 
the embodiment is a nucleic acid or protein-based assay. Some diagnostic assays detect 

5 mutations or polymorphisms in the invention nucleic acids or proteins, which contribute to 
cancerous aberrations. Other diagnostic assays identify and distinguish defects in protein 
activity by detecting a level of invention RNA or protein in a tested organism that 
resembles the level of invention RNA or protein in a organism suffering from a disease, 
such as cancer, or by detecting a level of RNA or protein in a tested organism that is 

1 0 different than an organism not suffering from a disease. 

Additionally, the manufacture of kits that incorporate the reagents and methods 
described in the following embodiments so as to allow for the rapid detection and 
identification of aberrations in protein activity or level are contemplated. The diagnostic 
kits can include a nucleic acid probe or an antibody or combinations thereof, which 

15 specifically detect a mutant form of the invention protein or a nucleic acid probe or an 
antibody or combinations thereof, which can be used to determine the level of RNA or 
protein expression of one or more invention protein. The detection component of these 
kits will typically be supplied in combination with one or more of the following reagents. 
A support capable of absorbing or otherwise binding DNA, RNA, or protein will often be 

20 supplied. Available supports include membranes of nitrocellulose, nylon or derivatized 
nylon that can be characterized by bearing an array of positively charged substituents. One 
or more restriction enzymes, control reagents, buffers, amplification enzymes, and non- 
human polynucleotides like calf-thymus or salmon-sperm DNA can be supplied in these 
kits. 

25 Useful nucleic acid-based diagnostic techniques include, but are not limited to, 

direct DNA sequencing, gradient gel electrophoresis. Southern Blot analysis, single- 
stranded confirmation analysis (SSCA), RNAse protection assay, dot blot analysis, nucleic 
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acid amplification, allele-specific PGR and combinations of these approaches. The starting 
point for these analyses is isolated or purified nucleic acid from a biological sample. It is 
contemplated that tissue biopsies would provide a good sample source. The nucleic acid 
is extracted from the sample and can be amplified by a DNA amplification technique such 
as the Polymerase Chain Reaction (PCR) using primers. Those of skill in the art will 
readily recognize methods available for confirming the presence of polymorphisms. In 
addition, any addressable array technology known in the art can be employed with this 
aspect of the invention. One particular embodiment of polynucleotide arrays is known as 
Genechips™, and has been generally described in US Patent 5,143,854; PCT publications 
WO 90/15070 and 92/10092. 

A wide variety of labels and conjugation techniques are known by those skilled in 
the art and can be used in various nucleic acid assays. There are several ways to produce 
labeled nucleic acids for hybridization or PCR including, but not limited to, oligolabeling, 
nick translation, end-labeling, or PCR amplification using a labeled nucleotide. 
1 5 Alternatively, a nucleic acid encoding an invention protein can be cloned into a vector for 
the production of an mRNA probe. Such vectors are known in the art, are commercially 
available, and can be used to synthesize RNA probes in vitro by addition of an appropriate 
RNA polymerase such as T7, T3 or SP6 and labeled nucleotides. A number of companies 
such as Pharmacia Biotech (Piscataway, NJ), Promega (Madison, WI), and U.S. 
20 Biochemical Corp (Cleveland, OH) supply commercial kits and protocols for these 
procedures. Suitable reporter molecules or labels include those radionuclides, enzymes, 
fluorescent, chemiluminescent, or chromogenic agents, as well as, substrates, cofactors, 
inhibitors, magnetic particles and the like. 

In preferred protein-based diagnostic, antibodies of the invention are attached to a 
25 support in an ordered array wherein a plurality of antibodies are attached to distinct 
regions of the support that do not overlap with each other. Those of skill in the art will 
readily recognize available assays that are protein-based diagnostics. Proteins are 
obtained from biological samples and are labeled by conventional approaches (e.g., 
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radioactivity, colorimetrically, or fluorescently). Employing labeled standards of a known 
concentration of mutant and/or wild-type invention protein, an investigator can accurately 
determine the concentration of the invention protein in a sample and from this information 
can assess the expression level of the particular form of the protein. Conventional methods 

5 in densitometry can also be used to more accurately determine the concentration or 
expression level of such protein. These approaches are also easily automated using 
technology known to those of skill in the art of high throughput diagnostic analysis. As 
detailed above, any addressable array technology known in the art can be employed with 
this aspect of the invention and display the protein arrays on the chips in an attempt to 

10 maximize antibody binding patterns and diagnostic information. 

As discussed above, the presence or detection of a polymorphism in an invention 
gene or protein can provide a diagnosis of a cancer or similar malady in an organism. 
Additional embodiments include the preparation of diagnostic kits comprising detection 
components, such as antibodies, specific for a particular polymorphic variant of invention 

15 gene or protein. The detection component will typically be supplied in combination with 
one or more of the following reagents. A support capable of absorbing or otherwise 
binding RNA or protein will often be supplied. Available supports for this purpose include, 
but are not limited to, membranes of nitrocellulose, nylon or derivatized nylon that can be 
characterized by bearing an array of positively charged substituents, and Genechips™ or 

20 their equivalents. One or more enzymes, such as Reverse Transcriptase and/or Taq 
polymerase, can be furnished in the kit, as can dNTPs, buffers, or non-human 
polynucleotides like calf-thymus or salmon-sperm DNA. Results from the kit assays can 
be interpreted by a healthcare provider or a diagnostic laboratory. Alternatively, diagnostic 
kits are manufactured and sold to private individuals for self-diagnosis. 

25 In addition to diagnosing disease according to the presence or absence of a 

polymorphism, some diseases involving cancer result from skewed levels of invention 
protein or gene in particular tissues or aberrant patterns of invention protein expression. 
By monitoring the level of expression in various tissues, for example, a diagnosis can be 
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made or a disease state can be identified. Similarly, by determining ratios of the level of 
expression of various invention proteins in specific tissues (e.g., patterns of expression) a 
prognosis of health or disease can be made. The levels of invention protein expression in 
various tissues from healthy individuals, as well as, individuals suffering from cancers is 

5 determined. These values can be recorded in a database and can be compared to values 
obtained from tested individuals. Additionally, the ratios or patterns of expression in 
various tissues from both healthy and diseased individuals is recorded in a database. These 
analyses are referred to as "disease state profiles" and by comparing one disease state 
profile (e.g. from a healthy or diseased individual) to a disease state profile from a tested 

10 individual, a clinician can rapidly diagnose the presence or absence of disease. 

The nucleic acid and protein-based diagnostic techniques described above can be 
used to detect the level or amount or ratio of expression of invention genes or proteins in a 
tissue. Through quantitative Northern hybridizations, in situ analysis, 
immunohistochemistry, ELISA, genechip array technology, PGR, and Western blots, for 
15 example, the amount or level of expression of RNA or protein for a particular invention 
protein (wild-type or mutant) can be rapidly determined and from this information ratios 
of expression can be ascertained. Alternatively, the invention proteins to be analyzed can 
be family members that are currently unknown but which are identified based on their 
possession of one or more of the homology regions described above. 

20 Without further description, it is believed that one of ordinary skill in the art can, 

using the preceding description and the following illustrative examples, make and utilize 
the compounds of the present invention and practice the claimed methods. The following 
working examples therefore, specifically point out preferred embodiments of the present 
invention, and are not to be construed as limiting in any way the remainder of the 

25 disclosure. 

EXAMPLES 
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p yAMPT.R 1: Identification of Di ffe rentially Expi^sed mRNA in Cancers - 1 

Global changes in gene expression between tumor biopsies and normal tissues 
have been examined using the GeneExpress Oncology Datasuite™ of Gene Logic, Inc. 
(Gaithersburg, MD). The database includes the gene expression profiles, generated by 
5 using the Affymetrix Human Genome U95 array, derived from normal and cancer tissue 
samples from many different organs. Among the tissue samples in the database, applicants 
analyzed the expression profiles of normal and cancer tissue sets from breast, colon, 
esophagus, kidney, liver, lung, lymph node, ovary, pancreas, prostate, rectum, and 
stomach. 

10 The Affymetrix Human Genome U95 array contains 63,175 probe sets. A probe 

set is a set of probes to detect one transcript (a gene or a cDNA clone), and usually 
consists of 16-20 oligonucleotide probe pairs. These probe pairs include perfectly matched 
sets and mismatched sets, both of which are necessary for the calculation of average 
difference. Average difference serves as a relative indicator of the level of expression of a 

15 transcript and is a measure of the intensity difference for each probe pair, calculated by 
subtracting the intensity of the mismatch from the intensity of the perfect match. This 
takes into consideration variability in hybridization among probe pairs and other 
hybridization artifacts that could affect the fluorescence intensities. Using the average 
difference value that has been calculated, an absolute call for each gene is made; "Absent" 

20 (= not detected), "Present" (= detected) or "Marginal" (= not clearly Absent or Present). 

Differential expression of genes between cancerous and normal tissue samples was 
determined with the following statistical methods. (1) For each probe set, average 
difference values and absolute calls were determined by Affymetrix Microarray Suite 
(v4.0). (2) In a given sample set, outliers among the tissue samples were detected by 
25 Principal Component Analysis (PCA) using MatLab program (The MathWorks, Inc.. 
Natick, MA). The data points used in the PCA were the average differences of randomly 
selected probe sets (5,000-6,000 probe sets). Outliers were excluded from further analysis. 
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(3) Variations of gene expression were analyzed by using the Fold Change Analysis tool 
of GeneExpress program. The fold change (cancerous/normal) was calculated by 
comparing the mean average difference for each gene in a cancerous sample set against 
the mean average difference of that gene in the normal tissue sample set Genes showing 
5 at least 3-fold increases or decreases in expression level were obtained. Genes were 
included in the analysis if they had a p-value of less than or equal to 0.05 as determined by 
an Analysis of Variance Test (Steel et a/., Principles and Procedu res of Statistics: A 
Biometrical Approach. Third Ed., McGraw-Hill, 1997). (4) Genes showing differential 
expression in at least 5 different cancer types were selected. 

10 Analysis of the chip data showed that the expression of the marker LFGl was 

significantly up-regulated in cancer tissue samples compared to samples from normal 
tissue. The expression level of LFGl (SEQ ID NO: 1 or 3) can be measured by chip 
sequence fragment no. 91875_s_at on Affymetrix GeneChips® U95. The 91875_s_at 
sequence is derived from the EST AI053741. The expression levels of 91875_s _at in 

15 various malignant neoplasms, compared to normal control tissues, are shown in Table 1, 
where the fold-change, the direction of the change (up- or down-regulation), p-value are 
also indicated. The fold change (cancerous/normal) was calculated by comparing the 
geometric mean of average difference in a cancerous sample set against the geometric 
mean of average difference in the normal tissue sample set. A fold change greater than 1.5 

20 was considered to be significant (Wodicka et al (1997), Nature Biotech. 15:1359-1367). 
Also indicated in the Table 1 are, for each tissue type, the numbers of samples that are 
called present, absent, or marginal together with the total number of samples in that 
sample set. These data indicate that up-regulation of LFGl may be diagnostic for cancer. 
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The GeneChip expression results, determined by sample binding to chip sequence 
fragment no. 91875_s _at, were validated by quantitative RT-PCR (Q-RT-PCR) using the 
Taqman® assay (Perkin-Elmer). PGR primers (5'-GCTGAAGCAGGAAAATCGCTT-3' 
(SEQ ID NO: 17) and S'-TGAGAGGGAGTGTGAGTCGGT-S' (SEQ ID NO: 18)) 
designed based on the sequence information file of the specific Afiymetrix fragment 
(91875_s _at) were used in the assay. The target gene in each RNA sample (10 ng of total 
RNA) was assayed relative to a reference gene. For this purpose, primers (5'- 
GTrnTGGTAATTTTGGCATGAAG-3' (SEQ ID NO: 19) and 5'- 
GGGCCAAGGTTTTGCrnT-3' (SEQ ID NO: 20)) specific to the GTBPl gene (G- 
terminal binding protein 1) were used to serve as control primers. This approach provides 
the relative expression as measured by cycle threshold (a) value of the target mRNA 
relative to an amount of GTBPl Gt value. The sample panel included total RNA pairs of 
normal and tumor tissues from colon, kidney, liver, lung, ovary, stomach and pancreas 
(Ambion. Inc., Austin, TX). The Q-RT-PCR data confirms the up-regulation of LFGl in 
cancer compared to normal samples. 

RVAMPI.K 2- Tinning of F -H-T pnrth Human cDNA (LFGl) Corresponding to 
Differentiallv Expres sed mRNA Species 

The fiiU-length cDNA having SEQ ID NO: 1 or 3 was obtained by polymerase chain 
reaction (PGR) and rapid amplification of cDNA ends (RAGE) using cDNA library from 
human heart (ResGen, Huntsville, AL). Gene-specific oligos for PGR (5*- 
GAGCCnTGGCTGTGTGAGTTGGGGA-3' (SEQ ID NO: 21), 5*- 
GCTGGAGCACGAGGAGTGGATTG-3' (SEQ ID NO: 22), 5'- 
GGAGCTGAGCAGGAGTGTAATGAA-3' (SEQ ID NO: 23). 5'- 
GAGGCCTGGCTGAAGGAGGAGGTTC-3' (SEQ ID NO: 24). 5' 
TGTGGAAGTAGTGCAGACGGCTGAGG-3' (SEQ ID NO: 25). 5' 
AGGCAAGGTGGGGTTTGTTATGGAGG-3' (SEQ ID NO: 26). 5' 



51 



wo 2004/035789 




'PCT/KR2003/002161 



GCTGTCAGATATGATGGTTCTGGAC-3' 


(SEQ 


ID 


NO: 


27), 


5'- 


CCAGCCTCACCACTGTTGGGTTGC-3' 


(SEQ 


ID 


NO: 


28), 


5'- 


CATTCTCTGAGCTGTATTAGTGT-3 ' 


(SEQ 


ID 


NO: 


29), 


5'- 


CCTGAGCTGGAATGACCTGCA-3' 


(SEQ 


ID 


NO: 


30), 


5'- 


CTTTGTGTTGGCTGC AGCC AC A-3 ' 


(SEQ 


ID 


NO: 


31), 


5'- 


TGAGGAGAGACTTTGCTGACTGGT-3' 


(SEQ 


ED 


NO: 


32). 


5'- 


GTCCTGTCTGGCGGTGCCGA-3' (SEQ 


ID 


NO: 


33). 


5'- 



GCTCCAGGATCCCCTGTCACCTGGGCCTTCTGCCTTTTGGCT-S' (SEQ ID NO: 34). 
5'-CCATATGGAGAGGAGAGCAGCGGGCCCA-3' (SEQ ID NO: 35). 5'- 

10 GAAGGAGGAACATGGAGAGGAGA-3' (SEQ ID NO: 36). 5'- 
CCATATGCCCCGGGTAGTCTACTGCAT-3' (SEQ ID NO: 37), and 5'- 
GTCGACTCGAGTCACTTCCGCAAAAACTTCTTG-3' (SEQ ID NO: 38)) and RACE 
(5'-TCCATTCCGAAGGCTCTCCTCC-3' (SEQ ID NO: 39), 5'- 
GTCTGTGTGACGGAAATGTAAGC-3' (SEQ ID NO: 40), and 5*- 

15 GAAGGTCGAAGGCAGACCGATGT-3' (SEQ ID NO: 41)) were designed based on 
predicted genes containing the 91875_s_at sequence using Human Genome Browser 
(University of California, Santa Cruz). The amplified products with the primers were 
incorporated into PCR4-Topo vector using Topo Cloning System (Invitrogen, Carlsbad. 
CA), and followed by sequencing. 

20 The nucleotide sequence of the full-length human cDNAs corresponding to the 

differentially regulated mRNA detected above is set forth in SEQ ID NOS: 1 and 3. In the 
former, the cDNA comprises 5293 base pairs. In the latter, the cDNA comprises 5317 base 
pairs. 

An open reading frame within the cDNA nucleotide sequence of SEQ ID NO: 1, at 
25 nucleotides 390-4880 (390-4883 including the stop codon), encodes a protein of 1497 
amino acids. The amino acid sequence corresponding to a predicted protein encoded by 
SEQ ID NO: 1 is set forth in SEQ ID NO: 2. Figure 2 shows the results of a hydrophobicity 
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analysis of the amino acid sequence of SEQ ID NO: 2 using Kyte-Doolittie values (Kyte 
and Doolittle (1982), J. Mol. Biol. 157:105-142). Hydrophilic regions may be used to 
produce antigenic peptides, as described above. 

An open reading frame within the cDNA nucleotide sequence of SEQ ID NO: 3, at 
5 nucleotides 12-4904 (12-4907 including the stop codon), encodes a protein of 1631 amino 
acids. The amino acid sequence corresponding to a predicted protein encoded by SEQ ID 
NO: 3 is set forth in SEQ ID NO: 4. Figure 3 shows the results of a hydrophobicity analysis 
of the amino acid sequence of SEQ ID NO: 4 using Kyte-Doolittle values (Kyte and 
Doolittle (1982). J. Mol. Biol. 157:105-142). Hydrophilic regions may be used to produce 
1 0 antigenic peptides, as described above. 

The protein sequence of SEQ ID NO: 2 is identical to that of SEQ ID NO: 4, except 
that SEQ ID NO: 2 lacks the first 134 amino acids at the N-terminus of SEQ ID NO: 4. 

SEQ ID NOS: 2 and 4 contain Calponin homology domain (amino acid positions 38- 
145 of SEQ ID NO: 4), IQ domain for calmodulin-binding (amino acid positions 629-646 of 
15 SEQ ID NO: 2 and amino acid positions 763-780 of SEQ ID NO: 4), RasGAP domain 
(amino acid positions 858-1195 of SEQ ID NO: 2 and amino acid positions 992-1329 of 
SEQ ID NO: 4), and RasGAP C-terminal domain (amino acid positions 1298-1421 of SEQ 
ID NO: 2 and amino acid positions 1432-1 555 of SEQ ID NO: 4). SEQ ID NOS: 2 and 4 
are similar to IQGAP proteins (Weissbach et al. {\99A), J Biol Chem 269:20517-20521; 
20 Brill et al. (1996), Mol Cell Biol 16:4869-4878). IQGAP binds to and modulate the 
function of proteins involved in cytoskeletal structure, cell-cell adhesion, and proliferation 
signaling (Fukada et al. (2002), Cell 109: 1-20; Briggs et al. (2002), J Biol Chem 277: 7453- 
7465; McCallum et al. (1998). J Biol Chem 273: 22537-22544). IQGAP 1-def.ciem mice 
exhibited a significant increase in late-onset gastric hyperplasia relative to wild-type (Li et 
25 al (2000), Mol Cell Biol 20: 697-701). 
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Analysis by Northern blot was performed to determine the size of the mRNA 
transcripts that correspond to LFGl. A Northern blot containing total RNAs from various 
human tissues was used (Human 12-Lane MTN Blot, Clontech, Palo Alto, CA), and an EST 
containing 91875_s _at sequence was radioactively labeled by the random primer method 
5 and used to probe the blot. The blot was hybridized in 50% formamide. 5X SSPE, 0.1% 
SDS. 5X Denhart's solution, and 0.2 mg/ml herring sperm DNA at 42»C and washed with 
0.2X SSC containing 0.1% SDS at room temperature. The Northern blot showed three 
transcripts for this gene, which are approximately 7.2 kb, and 6.3 kb in size. This 
corresponds to the sizes of the LFGl clones (SEQ ID NO: 1 and 3). 

10 F.XAMPLE 3: Identification of Differe n tially Expressed mRNA in Cancers - 2 

The process in EXAMPLE 1 was repeated except that the marker LFG2 was used 
instead of the marker LFG 1 . 

Analysis of the chip data showed that the expression of the marker LFG2 was 
significantly down-regulated in cancer tissue samples compared to samples from normal 
15 tissue. The expression level of LFG2 (SEQ ID NO: 5) can be measured by chip sequence 
fragment no. 82941_at on Affymetrix GeneChips® U95. The 82941_at sequence is derived 
from the EST AI277612. The expression levels of 82941_at in various malignant neoplasms, 
compared to normal control tissues, are shown in Table 2, where the fold-change, the 
direction of the change (up- or down-regulation), p-value are also indicated. The fold 
20 change (cancerous/normal) was calculated by comparing the geometric mean of average 
difference in a cancerous sample set against the geometric mean of average difference in the 
normal tissue sample set. A fold-change greater than 1.5 was considered to be significant 
(Wodicka et al. (1997). Nature Biotech. 15:1359-1367). Also indicated in the Table 2 are, 
for each tissue type, the numbers of samples that are called present, absent, or marginal 
25 together with the total number of samples in that sample set These data indicate that down- 
legulation of LFG2 may be diagnostic for cancer. 
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The GeneChip expression results, determined by sample binding to chip sequence 
fragment no, 82941_at, were validated by quantitative RT-PCR (Q-RT-PCR) using the 
Taqman® assay (Perkin-Elmer). PGR primers (5'- 

GAATGTGTCAGAGAGAAGTGGAGG-3' (SEQ ID NO: 42) and 5'- 

5 TGTAGAAACTCTTGGAGTAATGGAGG-3' (SEQ ID NO: 43)) designed based on the 
sequence infonnation file of the EST containing the Affymetrix fragment (82941_at) were 
used in the assay. The target gene in each RNA sample (10 ng of total RNA) was assayed 
relative to a reference gene. For this purpose, primers (5'- 

GTTTTTCCTAATTTTGGCATGAAC-3' (SEQ ID NO: 19) and 5»- 

10 CGCCCAAGCTTTTCCTTTT-3* (SEQ ID NO: 20)) specific to the CTBPl gene (C- 
terminal binding protein 1) were used to serve as control primers. This approach provides 
the relative expression as measured by cycle direshold (Ct) value of the target mRNA 
relative to an amount of CTBPl Ct value. The sample panel included total RNA pairs of 
normal and tumor tissues from colon, liver, lung, ovary, and stomach (Ambion, Inc., Austin, 

15 TX). The Q-RT-PCR data confirms the down-regulation of LFG2 in cancer compared to 
normal samples. 

FXAMPLE 4: Cloning of Full-Leneth Human cDNA fLFG2 ^ Corresponding to 
DifTerentiallv Expressed mRNA Species 

The full-length cDNA having SEQ ID NO: 5 was obtained by the oligo-puUing 
20 method using the GeneTrapper assay (Life Technologies, Rockville, MD). Briefly, a gene- 
specific oligo (5'-GAATGTGTCAGAGACAAGTGCAGC-3* (SEQ ID NO: 42)) was 
designed based on tfie sequence of the EST containing 82941_at sequence. The oligo was 
labeled with biotin and used to hybridize with 5 |ig of single strand plasmid DNA (cDNA 
recombinants) ftom a poorly differentiated stomach adenocarcinoma library (NCI CGAP 
25 Gas4) (ResGen, Huntsville, AL) following the procedures of Sambrook et al The 
hybridized cDNAs were separated by streptavidin-conjugated beads and eluted by heating. 



56 



wo 2004/035789 




PCT/KR2003/002161 



The eluted cDNA was converted to double strand plasmid DNA and used to transform E. 
coli cells (DHIOB) and the longest cDNA was screened. After positive selection was 
confirmed by PGR using gene-specific primers, the cDNA clone was subjected to DNA 
sequencing. 

5 The nucleotide sequence of the full-length human cDNAs corresponding to the 

differentially regulated mRNA detected above is set forth in SEQ ID NO: 5. The cDNA 
comprises 3608 base pairs. 

An open reading frame within the cDNA nucleotide sequence of SEQ ID NO: 5, at 
nucleotides 424-1908 (424-191 1 including the stop codon), encodes a protein of 495 amino 
10 acids. The amino acid sequence corresponding to a predicted protein encoded by SEQ ID 
NO: 5 is set forth in SEQ ID NO: 6. 

SEQ ID NO: 6 has homology to scavenger receptors, which are involved in 
endocytosis of selected polyanionic ligands, phagocytosis of apoptotic cells and bacteria, 
cell adhesion, and development of atherosclerosis (Peiser et al (2002), Curr. Opin. 

15 Immunol. 14:123-128; Resnick et al. (1994), Trends Biol ScL 19:5-8). Based on published 
studies of scavenger receptors, SEQ ID NO: 6 contains a cytoplasmic domain (amino acid 
positions 1-35), a transmenbrane domain (amino acid positions 36-58), an a-helical coiled- 
coil domain (amino acid positions 90-301), a collagen-like domain (amino acid positions 
305-380), and a scavenger receptor cystein-rich (SRCR) domain (amino acid positions 393- 

20 493). The SRCR domain contains six cysteine residues (amino acid positions 418, 431, 462, 
472, 482, and 492), which may participate in intradomain disulfide bonds. SEQ ID NO: 6 
also exhibits homology to a mouse homologue (GenBank Accession No. BCO 16096). It 
shows 70% identity over the entire contiguous sequence. 

Figure 4 shows the results of a hydrophobicity analysis of the amino acid sequence 
25 of SEQ ID NO: 6 using Kyte-Doolittle values (Kyte and Doolittle ( 1 982), J. Mol Biol 
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157:105-142). Hydrophilic regions may be used to produce antigenic peptides, as described 
above. 

Analysis by Northern blot was performed to determine the size of the mRNA 
transcripts that correspond to LFG2. A Northern blot containing total RNAs from various 
5 human tissues was used (Human MTN Blot, Clontech, Palo Alto, CA), and the EST 
containing 82941__at sequence was radtoactively labeled by the random primer method and 
used to probe the blot. The blot was hybridized in 50% formamide, 5X SSPE, 0.1% SDS, 
5X Denhart's solution, and 0.2 mg/ml herring sperm DNA at 42°C and washed with 0.2X 
SSC containing 0.1% SDS at room temperature. The Northern blot showed a single 
10 transcript for this gene, which is approximately 3.7 kb in size. This corresponds to the size 
of the LFG2 clone (SEQ ID NO: 5). 

EXAMPLE 5: Identification of Differentiallv Expressed mRNA in Cancers - 3 

The process in EXAMPLE 1 was repeated except that the marker LFG3 was used 
instead of the marker LFG 1 . 

IS Analysis of the chip data showed that the expression of the marker LFG3 was 

significantly down-regulated in cancer tissue samples compared to samples from normal 
tissue. The expression level of LFG3 (SEQ ID NO: 7) can be measured by chip sequence 
fragment no. 46104_at on Affymetrix GeneChips® U95. The 46104_at sequence is derived 
from the EST AA772055. The expression levels of 46104_at in various malignant 

20 neoplasms, compared to normal control tissues, are shown in Table 3, where the fold- 
change, the direction of the change (up- or down-regulation), p-value are also indicated. The 
fold change (cancerous/normal) was calculated by comparing the geometric mean of 
average difference in a cancerous sample set against the geometric mean of average 
difTerence in the normal tissue sample set. A fold-change greater than 1.5 was considered to 

25 be significant (Wodicka et al. (1997), Nature Biotech. 15:1359-1367). Also indicated in the 
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Table 3 are, for each tissue type, the numbers of samples that are called present, absent, or 
marginal together with the total number of samples in that sample set. These data indicate 
that down-regulation of LFG3 may be diagnostic for cancer. 
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The GeneChip expression results, determined by sample binding to chip sequence 
fragment no. 46104 at. were validated by quantitative RT-PCR (Q-RT-PCR) using the 
Taqman® assay (Perkin-Elmer). PGR primers (5'- 

GTATGCATCAGAATTCCCTATAGATCTTr.3' (SEQ ID NO: 44) and 5'- 
5 TAGATGnTGGGCAACAGCCT-3- (SEQ ID NO: 45)) designed based on the sequence 
information file of the EST containing the Affymetrix fragment (46104_at) were used in the 
assay The target gene in each RNA sample (10 ng of total RNA) was assayed relative to a 
reference gene. For this purpose, primers (5'-GTnTrCCTAATnTGGCATGAAC-3' 
(SEQ ID NO: 19) and S'-CGCCCAAGCTTTTCCTnT-S' (SEQ ID NO: 20)) specific to 
10 the CTBPl gene (C-terminal binding protein 1) were used to serve as control primers. This 
approach provides the relative expression as measured by cycle threshold (Ct) value of the 
target mRNA relative to an amount of CTBPl Ct value. The sample panel included total 
RNA pairs of normal and tumor tissues from colon, kidney, ovary, pancreas, and stomach 
(Ambion. Inc.. Austin. TX). The Q-RT-PCR data confirms the down-regulation of LFG3 m 
1 5 cancer compared to normal samples. 

r^f.^^.. ^. r..nin ^ nf Fun- l.nPth Huma n cDNA q Fm) Corresponding to 
rufferantiallv ExP "'>Q«8ftd mRNA Species 

n,e fiilUength cDNA having SEQ ID NO: 7 was obtained by the oligo-pulling 
method using the GeneTrapper assay (Life Technologies. Rockville. MD). Briefly, a gene- 
20 specific oligo (5'-GTATGCATCAGAATTCCCTATAGATCTrr.3' (SEQ ID NO: 44)) 
was designed based on the sequence of the EST containing 46104_at sequence. The oligo 
was labeled with biotin and used to hybridize with 5 jig of single strand piasmid DNA 
(cDNA recombinants) from human fetal kidney (ResGen. Huntsville, AL) foUowmg the 
procedures of Sambrook et al. The hybridized cDNAs were separated by streptavidm- 
25 conjugated beads and eluted by heating. The eluted cDNA was converted to double strand 
piasmid DNA and used to transform E. coli cells (DHIOB) and the longest cDNA was 
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screened. After positive selection was confirmed by PCR using gene-specific primers, the 
cDNA clone was subjected to DNA sequencing. The 5'-end of LFG3 was identified by 
rapid amplification of cDNA ends (RACE) using the cDNA prepared from human fetal 
kidney (Clontech, Palo Alto, CA) and a gene specific primer (5'- 
5 TTCCTTCACCAAAGGCATCCAGCCATTCTATG-3' (SEQ ID NO: 46)). 

The nucleotide sequence of the full-length human cDNAs corresponding to the 
differentially regulated mRNA detected above is set forth in SEQ ID NO: 1. The cDNA 
comprises 3162 base pairs. 

An open reading frame within the cDNA nucleotide sequence of SEQ ID NO: 7, at 
10 nucleotides 40S-183S (405-1838 including the stop codon), encodes a protein of 477 amino 
acids. The amino acid sequence corresponding to a predicted protein encoded by SEQ ID 
NO: 7 is set forth in SEQ ID NO: 8. 

SEQ ID NO: 8 is similar to monocarboxylate transporters (MCTs) and contains ten 
predicted transmembrane domains (amino acids positions 10-29, 80-99, 107-128, 140-160, 
15 274-295, 312-332, 339-360, 363-384, 396-416, and 433-451). MCT proteins catalyze the 
facilitated transport of monocarboxylates such as lactate, pyruvate, branched-chain oxo 
acids, ketone bodies, beta-hydroxy-butylate, and acetate (Halestrap and Price (1999), 
Biochem. J. 343:281-299). Table 4 summarizes the similarity ratios of SEQ ID NO: 4 with 
the eight known monocarboxylate transporters. 

20 



TABLE 4 . Homology of LFG3 with MCT proteins 



Protein 


Size (amino acids) 


Identity (%) 


Positives (%) 


MCT1 


500 


17.5 


34.3 


MCT2 


478 


19.5 


35.5 


MCT3 


504 


19.5 


34.1 


MCT4 


465 


19.0 


33.2 
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MCT5 


487 


22.1 


36.9 


MCT6 


505 


16.4 


31.5 


MCT7 


523 


20.1 


35.2 


MCT8 


613 


15.9 


27.9 



Figure S shows the results of a hydrophobicity analysis of the amino acid sequence 
of SEQ ID NO: 8 using Kyte-Doolittle values (Kyte and Doolittle (1982), J. Mol Biol 
157:103-142). Hydrophilic regions may be used to produce antigenic peptides, as described 
S above. 

Analysis by Northern blot was performed to determine the size of the mRNA transcripts that 
correspond to LFG3. A Northern blot containing total RNAs from various human tissues 
was used (Human 12-Lane MTN Blot, Clontech, Palo Alto, CA), and the EST containing 
46104_at sequence was radioactively labeled by the random primer method and used to 
10 probe the blot. The blot was hybridized in 50% formamide, 5X SSPE, 0.1% SDS, 5X 
Denhart's solution, and 0.2 mg/ml herring sperm DNA at 42^C and washed with 0.2X SSC 
containing 0.1% SDS at room temperature. The Northern blot showed a single transcript for 
this gene, which is approximately 4.2 kb in size. This corresponds to the size of the LFG3 
clone (SEQ ID NO: 7). 

15 EXAMPLE 7: Identification of Differcntiallv Expressed mRNA in Cancers - 4 

The process in EXAMPLE 1 was repeated except that the marker LFG4 was used 
instead of the marker LFGl . 

Analysis of the chip data showed that the expression of the marker LFG4 was 
significantly down-regulated in cancer tissue samples compared to samples from normal 
20 tissue. The expression level of LFG4 (SEQ ID NO: 9) can be measured by chip sequence 
fragment no. 62158_at on Affymetrix GeneChips® U95, The 622158_at sequence is 
derived from the EST All 23532. The expression levels of 62158_at in various malignant 
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neoplasms, compared to normal control tissues, are shown in Table 5, where the fold- 
change, the direction of the change (up- or down-regulation), p-value are also indicated. The 
fold change (cancerous/normal) was calculated by comparing the geometric mean of 
average difference in a cancerous sample set against the geometric mean of average 
difference in the normal tissue sample set. A fold-change greater than 1.5 was considered to 
be significant (Wodicka et al. (1997), Nature Biotech. 15:1359-1367). Also indicated in the 
Table 5 are. for each tissue type, the numbers of samples that are called present, absent, or 
marginal together with the total number of samples in that sample set. These data indicate 
that down-regulation of LFG4 may be diagnostic for cancer. 
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The GeneChip expression results, determined by sample binding to chip sequence 
fragment no. 62158 at. were validated by quantitative RT-PCR (Q-RT-PCR) using the 
Taqman® assay (Peridn-Elmer). PCR pr-ers (5- 

AAATGTCTGATrACCCCATITrATCAGT.3' (SEQ ID NO: 47) and 5- 
1 TAATCCTGAAATGAACAGCTAACA-3') (SEQ ID NO: 48) designed based on the 
sequence information file of the EST containing the Affymetrix fragment (62158_at) were 
used in the assay. The target gene in each RNA sample (10 ng of total RNA) was assayed 
relative to a reference gene. For this purpose, primers (5'- 

GTmTCCTAATnTGGCATGAAC-y (SEQ ID NO: 19) and 5- 
0 CGCCCAAGCTTTTCCTTTT-S' (SEQ ID NO: 20)) specific to the CTBPl gene (C- 
tem^inal binding protein 1) were used to serve as control primers. TMs approach provides 
the relative expression as measured by cycle threshold (Ct) value of the target mRNA 
relative to an amount of CTBPl Ct value. The sample panel included total RNA pairs of 
nom^al and tumor tissues from colon, liver, lung, ovary, pancreas, and stomach (Amb.on, 
15 Inc., Austin. TX). The Q-RT-PCR data confirms the down-regulation of LFG4 m cancer 
compared to normal samples. 

n...;r. , of Fnll-Le nrnh Human cDNA (T Fn4) Conesponding to 
rnffi.rAntiall v Express ^H mRNA Soecies 

The foil-length cDNA having SEQ ID NO: 9 was obtained by rapid amplification of 
20 cDNA ends (RACE). Briefly. gene-specific oligos (5'- 
TAATGrrAGAGTAACAGCAnTTCCrTCAA-3' (SEQ ID NO: 49) and 5'- 
TGCCCCACACTAACTCAGTTCTTGTGATG-3' (SEQ ID NO: 50)) were designed based 
on the sequence of the EST containing 62158_at sequence. The oligos was used for PCR 
amplification of the cDNAs prepared from human brain (Clontech. Palo Alto. CA). The 
25 amplified products with the primers were incorporated into PCR4.Topo vector using Topo 
Cloning System (Invitrogen. Carlsbad. CA). and followed by sequencing. 
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The nucleotide sequence of the full-length human cDNAs corresponding to the 
differentially regulated mRNA detected above is set forth in SEQ ID NO: 9. "me cDNA 
comprises 4891 base pairs. 

An open reading frame within the cDNA nucleotide sequence of SEQ ID NO: 9. at 
5 nucleotides 89-1 150 (89-1 153 including the stop codon). encodes a protein of 354 amino 
acids. The amino acid sequence corresponding to a predicted protein encoded by SEQ ID 
NO: 9 is set forth in SEQ ID NO: 10. 

SEQ ID NO: 10 is similar to rat Kilon and chicken Neuiotractin (Funatsu et al. 
(1999), J Biol Chem 274:8224-8230; Marg et al (1999). J Cell Biol 145:865-876). Protein 
10 sequence analysis reveals a secretory signal peptide (amino acid positions 1-33). three 
immunoglobulin domains (amino acid positions 47-136. 145-208, and 231-312). and six 
putative A^-Unked glycosylation sites (amino acid positions 73. 155. 275. 286. 294. and 307). 
Kilonmeurotractin is a member of IgLON subfamily of the immunoglobulin superfemily. 
IgLONs are a family of glycosylphosphatidylinositol (GPD-Hnked cell adhesion molecules 
which are thought to modify neurite outgrowth and might play a role in cell-cell adhesion 
and recognition (Miyate et al. (2000), J Comparative Neurol 424:74-85). 

Figure 6 shows the results of a hydrophobicity analysis of the amino acid sequence 
of SEQ ID NO: 10 using Kyte-Doolittle values (Kyte and Doolittle (1982), J. Mol. Biol. 
157:105-142). Hydrophilic regions may be used to produce antigenic peptides, as described 
above. This hydropathy plot shows the presence of hydrophobic region at the C-terminus. In 
case of GPI-anchored proteins, the addition of the GPI anchor is known to occur after the 
cleavage of the C-term.nal hydrophobic region. A putative GPI anchor attachment site was 
found (Gly at the amino acid position 324). 

Analysis by Northern blot was performed to determine the size of the mRNA 
transcripts that correspond to LFG4. A Northern blot containing total RNAs from various 
human tissues was used (Human 12-Lane MTN Blot, Clontech. Palo Alto, CA). and the 
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EST containing 62158_at sequence was radioactively labeled by the random primer method 
and used to probe the blot. The blot was hybridized in 50% formamide. 5X SSPE, 0.1% 
SDS. 5X Denhart's solution, and 0.2 mg/ml herring sperm DNA at 42°C and washed with 
0.2X SSC containing 0.1% SDS at room temperature. The Northern blot showed a single 
transcript for this gene, which is approximately 5.4 kb in size. This corresponds to the size 
of the LFG4 clone (SEQ ID NO: 9). 

F.XAMPLE 9- Identification of Differenti ally Rxpressed mRNA in Cancers - ? 

The process in EXAMPLE 1 was repeated except that the marker LFG5 was used 
instead of the marker LFGl. 

Analysis of the chip data showed that the expression of the marker LFG5 was 
significantly down-regulated in cancer tissue samples compared to samples from normal 
tissue. The expression level of LFG5 (SEQ ID NO: 1 1) can be measured by chip sequence 
fragment no. 46659_at on Affymetrix GeneChips® U95. The expression levels of 46659_at 
in various malignant neoplasms, compared to normal control tissues, are shown in Table 6, 
where the fold-change, the direction of the change (up- or down-regulation), p-value are 
also indicated. The fold change (cancerous/normal) was calculated by comparing the 
geometric mean of average difference in a cancerous sample set against the geometric mean 
of average difference in the normal tissue sample set. Also indicated in the Table 6 are, for 
each tissue type, the numbers of samples that are called present, absent, or marginal together 
with the total number of samples in that sample set. These data indicate that differential 
regulation of LFG5 may be diagnostic for cancer. 
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Tha GeneChip expression results, determined by sample binding to chip sequence 
fragment no. 46659_at. were validated by quantitative RT-PCR (Q-RT-PCR) using the 
Taqman® assay (Perkin-Elmer). PCR primers (5'- 

AAGGCnTATCAGGTCTGCATATAGAATC-3' (SEQ ID NO: 51) and 5'- 
5 GCAAAGAACCCTAATGCTATTTATCAGC-3' (SEQ ID NO: 52)) designed based on the 
sequence information file of the specific Affymetrix fragment (46659_at) were used in the 
assay. The target gene in each RNA sample (10 ng of total RNA) was assayed relative to a 
reference gene. For this puipose. primers (5'^rmTCCTAATTTTGGCATGAAC-3' 
(SEQ ID NO: 19) and S'-CGCCCAAGCTTTTCCTTTT-S' (SEQ ID NO: 20)) specific to 
10 the CTBPl gene (C-terminal binding protein 1) were used to serve as control primers. This 
approach provides the relative expression as measured by cycle threshold (Ct) value of the 
target mRNA relative to an amount of CTBPl Ct value. The sample panel included total 
RNA pairs of normal and tumor tissues from kidney, lung, ovary, and pancreas (Ambion, 
Inc., Austin, TX). The Q-RT-PCR data confirms the differential regulation of LFG5 in 
1 5 cancer compared to normal samples. 

PVAMPT.R 10: Clop in p of Full-I pn pth Human cONA (LFG5> Corresponding to 
niffftrentiallv Exp ressed mRNA Species 

The fiill-length cDNA having SEQ ID NO: 11 was obtained by the oligo-pulling 
method using the GeneTrapper assay (Life Technologies, Rockville. MD). Briefly, a gene- 
20 specific oligo (5'-GAGAAGACCAGGGAAGAAGCAG-3' (SEQ ID NO: 53)) was 
designed based on the sequence of an EST containing 46659_at sequence. The oligo was 
labeled with biotin and used to hybridize with 5 jig of single strand plasmid DNA (cDNA 
recombinants) from a human heart library (ResGen, Huntsville. AL) following the 
procedures of Sambrook et al The hybridized cDNAs were separated by streptavidin- 
25 conjugated beads and eluted by heating. The eluted cDNA was converted to double strand 
plasmid DNA and used to transform E. coli cells (DHIOB) and the longest cDNA was 
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screened. After positive selection was confirmed by PGR using gene-specific primers, the 
cDNA clone was subjected to DNA sequencing. 

The nucleotide sequence of the full-length human cDNAs corresponding to the 
differentially regulated mRNA detected above is set forth in SEQ ID NO: 1 1. The cDNA 
S comprises 3098 base pairs. 

An open reading frame within the cDNA nucleotide sequence of SEQ ID NO: 1 1, at 
nucleotides 223-1569 (223-1572 including the stop codon), encodes a protein of 449 amino 
acids. The amino acid sequence corresponding to a predicted protein encoded by SEQ ID 
NO: 1 1 is set forth in SEQ ID NO: 12, 

10 SEQ ID NO: 12 contains a thymidylate kinase domain (amino acid positions 257- 

438). Thymidylate kinase is a member of nucleotide monophosphate kinases (NMPKs) 
which play roles in the nucleotide synthesis for RNA and DNA synthesis and are required 
for the pharmacological activation of therapeutic nuicleoside and nucleotide analogs (Van 
Rompaye/a/. (2000), Pharmacology & Therapeutics %1M9A9«). SEQ ID NO: 12 exhibits 

15 homology to a mouse thymidylate kinase (GenBank Accession No. NM_020557) which is 
induced during macrophage activation (Lee and O'Brien (1995), J Immunol. 154:6094- 
6102). It shows 63% identity over the entire contiguous sequence. 

Figure 7 shows the results of a hydrophobicity analysis of the amino acid sequence 
of SEQ ID NO: 12 using Kyte-Doolittle values (Kyte and Doolittle (1982), J. Mol Biol. 
20 157:105-142). Hydrophilic regions may be used to produce antigenic peptides, as described 
above. 

Analysis by Northern blot was performed to determine the size of the mRNA 
transcripts that correspond to LFG5. A Northern blot containing total RNAs from various 
human tissues was used (Human MTN Blot, Clontech, Palo Alto, CA), and an EST 
25 containing 8294 l_at sequence was radioactively labeled by the random primer method and 
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used to probe the blot. The blot was hybridized in 50% formamide, 5X SSPE, 0.1% SDS, 
5X Denhart's solution, and 0.2 mg/ml herring sperm DNA at 42*^0 and washed with 0.2X 
SSC containing 0.1% SDS at room temperature. The Northern blot showed a single 
transcript for this gene, which is approximately 3.0 kb in size. This corresponds to the size 
5 of the LFG5 clone (SEQ ID NO: 1 1). 

EXAMPLE 1 1 : Identification of Differentially Expressed mRNA in Cancers - 6 

The process in EXAMPLE 1 was repeated except that the marker LFG6 was used 
instead of the marker LFG 1 . 

Analysis of the chip data showed that the expression of the marker LFG6 was 
10 significantly up-regulated in cancer tissue samples compared to samples from normal tissue. 
The expression level of LFG6 (SEQ ID NO: 13 or 15) can be measured by chip sequence 
fragment no. 44103_at on Affymetrix GeneChips® U95. The 44103__at sequence is derived 
from the EST AA865614. The expression levels of 44103_at in various malignant 
neoplasms, compared to normal control tissues, are shown in Table 7, where the fold- 
15 change, the direction of the change (up- or down-regulation), p-value are also indicated. The 
fold change (cancerous/normal) was calculated by comparing the geometric mean of 
average difference in a cancerous sample set against the geometric mean of average 
difference in the normal tissue sample set. A fold change greater than 1.5 was considered to 
be significant (Wodicka et al (1997), Nature Biotech, 15:1359-1367). Also indicated in the 
20 Table 7 are, for each tissue type, the numbers of samples that are called present, absent, or 
marginal together with the total number of samples in that sample set. These data indicate 
that up-regulation of LFG6 may be diagnostic for cancer. 
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10 



The CneChip expression .esults, dettmJneJ by sample bindiag to chip «<,uence 

tZ™® »s.y (Peridn-Elmer). PCR prim«s (5-.GOACOGOOAACTrGOACGC-3 
™ nO. 54) lnd 5-.AAGTGCAGGGCCrCTGGGTG.3. (SEQ ID NO: 55)) des,gne<> 
Z on *e se, eno. inft,nna.io„ flle of *e s,.oi«c A«yn,e«x ftagn,en. (441«3.a.) were 
i::^in .be Jy. ^.e gene in each RNA »«ple (.0 ng of ««a, BMA) was as^yed 
„U.ive .o a -efc-ce gene. For to P"^ -'"^ 
GTrrrrCCTAATrTrGGCATGAAC.3- (SEQ ID NO: 19) and ' 
COCCCAAOCrmCCmr-J- (SEQ n, no: iO,) specifle «. *e CTBP. gen. ^ 
.em,i„al binding pro«U. 1) were n«a . serve a, con«ol pHnrers ^^ '^'^-J^ 
d,e relative expression as measured by eycle ftreshold (O) value of *e »rg« «.RNA 
reroanlunlofCraPl av.ue. -''^ ^^'^-^'^-^ 

nom,al and wmor .issues fton> liver and ovary (Ambion, Ine., Austrn. TX). The Q-RT PCR 
dau. confirms ft. up-regulation of LFG6 in cancer comp^ed «. normal sampl«. 



IS 




20 



25 



The «.ll-l«.g* CDNA having SEQ ID NO: 13 or 15 was ob«in.d by .he o„go- 
puUing mrth«. using <h. Oen.Trapp« assay (Lif. T»l».ologi«. Rockville, ^D)' B-Ay. 

gelspeciflc oligo C.-CCCTOGGTCATCGGACGCT-. (SBQ ID NO: » v^ 
aesl,«. based on ft. sequence of an EST conuiniag 44103_a, 
1,^::, wift bio.in and us«l . KybHdi« «ift S .g of single s«and ^^'^^^J^^^ 
^mbinanu) ftom a Miy di».».«a«d buman sumach •-"ocarcmom' 
Hun«vi,.e. AL, following ft. procedures of S^nbrook « ol. m hyb„d.»d DNAs wer. 
Z-ed y s rep.vidin^njuga«d b«ds and elu«d b, healing. ^ '^^^l 
Tnver^d » double s«nd pUsmid DNA ».d us«l «. Uansfonn E. coU ells (DH.OB) »^ 
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the longest cDNA was screened. After positive selection was confirmed by PCR using 
gene-specific primers, the cDNA clone was subjected to DNA sequencing. 

The nucleotide sequence of the full-length human cDNAs corresponding to the 
differentially regulated mRNA detected above is set forth in SEQ ID NOS: 13 and 15. In the 
former, the cDNA comprises 1893 base pairs. In the latter, the cDNA comprises 1597 base 
pairs. 

An open reading frame within the cDNA nucleotide sequence of SEQ ID NO: 13, at 
nucleotides 418-1392 (418-1395 including the stop codon). encodes a protein of 325 amino 
acids. The amino acid sequence corresponding to a predicted protein encoded by SEQ ID 
NO: 13 is set forth in SEQ ID NO: 14. Figure 9 shows the results of a hydrophobicity 
analysis of the amino acid sequence of SEQ ID NO: 14 using Kyte-Doolittle values (Kyte 
and Doolittle (1982). J. Mol Biol 157:105-142). Hydrophilic regions may be used to 
produce antigenic peptides, as described above. 

An open reading frame within the cDNA nucleotide sequence of SEQ ID NO: 15, at 
15 nucleotides 271-1431 (271-1434 including the stop codon), encodes a protein of 387 amino 
acids. The amino acid sequence corresponding to a predicted protein encoded by SEQ ID 
NO: 15 is set forth in SEQ ID NO: 16. Figure 10 shows the results of a hydrophobicity 
analysis of the amino acid sequence of SEQ ID NO: 16 using Kyte-Doolittle values (Kyte 
and Doolittle (1982), J. Mol Biol 157:105-142). Hydrophilic regions may be used to 
20 produce antigenic peptides, as described above. 

SEQ ID NOS: 14 and 16 contain ubiquitin homologues (UBQ) domain (amino acid 
positions 239-300). SEQ ID NOS: 14 and 16 are similar to rat Sharpin protein (Lim et al 
(2001). Mol Cell Neurosci 17:385-397). Sharpin directly interacts with the ankyrin repeats 
of Shank protein which functions in the organization of cytoskeletal complexes and 
25 intracellular signaling at specialized cell junctions (Sheng and Kim (2000). J Cell Sci 
113:1851-1856). 
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Analysis by Northern blot was performed to determine the size of the mRNA 
transcripts that correspond to LFG6. A Northern blot containing total RNAs from various 
human tissues was used (Human 12-Lane MTN Blot. Clontech. Palo Alto. CA). and an EST 
containing 44103_at sequence was radioactively labeled by the random primer method and 
used to probe the blot. The blot was hybridized in 50% formamide. 5X SSPE. 0.1% SDS, 
5X Denhart's solution, and 0.2 mg/ml herring sperm DNA at Al^C and washed with 02X 
SSC containing 0.1% SDS at room temperature. The Nortfiem blot showed three transcripts 
for this gene, which are approximately 2.2 kb, 1.5 kb, and 1.2 kb in size. This corresponds 
to the sizes of the LFG6 clones (SEQ ID NO: 13 and 15). 

Although the present invention has been described in detail with reference to 
examples above, it is understood that various modifications can be made without departing 
from the spirit of the invention. Accordingly, the invention is limited only by the following 
claims. All cited patents, patent applications and publications referred to in this application 
are herein incorporated by reference in their entirety. 
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WHAT IS CLAIMED IS; 

1. An isolated nucleic acid molecule selected from the group consisting of: (a) 
an isolated nucleic acid molecule comprising SEQ ID NO: 1, 3, 5, 7, 9, 1 1, 13 or IS, (b) an 
isolated nucleic acid molecule encoding SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16, (c) an 

S isolated nucleic acid molecule that encodes a protein that is expressed in cancer and that 
exhibits at least about 75% nucleotide sequence identity over the entire contiguous sequence 
of SEQ ID NO: 1, 3, 5, 7, 9, 11, 13 or 15, and (d) an isolated nucleic acid molecule 
comprising the complement of a nucleic acid molecule of (a), (b) or (c). 

2. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid 
10 molecule comprises nucleotides 390-4880 of SEQ ID NO: 1, nucleotides 12-4904 of SEQ 

ID NO: 3, nucleotides 424-1908 of SEQ ID NO: 5, nucleotides 405-1 835 of SEQ ID NO: 7, 
nucleotides 89-1150 of SEQ ID NO: 9, nucleotides 223-1569 of SEQ ID NO: 11, 
nucleotides 418-1392 of SEQ ID NO: 13, or nucleotides 271-1431 of SEQ ID NO: 15. 

3. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid 
15 molecule comprises nucleotides 390-4883 of SEQ ID NO: 1, nucleotides 12-4907 of SEQ 

ID NO: 3, nucleotides 424-191 1 of SEQ ID NO: 5, nucleotides 405-1838 of SEQ ID NO: 7, 
nucleotides 89-1153 of SEQ ID NO: 9, nucleotides 223-1572 of SEQ ID NO: 11, 
nucleotides 418-1395 of SEQ ID NO: 13, or nucleotides 271-1434 of SEQ ID NO: 15. 

4. The isolated nucleic acid molecule of claim 1, wherein the nucleic acid 
20 molecule consists of nucleotides 390-4883 of SEQ ID NO: 1, nucleotides 12-4907 of SEQ 

ID NO: 3, nucleotides 424-1908 of SEQ ID NO: 5, nucleotides 405-1835 of SEQ ID NO: 7, 
nucleotides 89-1153 of SEQ ID NO: 9, nucleotides 223-1569 of SEQ ID NO: 11, 
nucleotides 418-1395 of SEQ ID NO: 13, or nucleotides 271-1434 of SEQ ID NO: 15. 

5. The isolated nucleic acid molecule of any one of claims 1-4, wherein said 
25 nucleic acid molecule is operably linked to one or more expression control elements. 
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6. A vector comprising an isolated nucleic acid molecule of any one of claims 

1-4. 

7. A host cell transformed to contain the nucleic acid molecule of any one of 
claims 1-4. 

S 8. A host cell comprising a vector of claim 6. 

9. A host cell of claim 8, wherein said host cell is selected from the group 
consisting of prokaryotic host cells and eukaryotic host cells. 

10. A method for producing a polypeptide comprising culturing a host cell 
transformed with the nucleic acid molecule of any one of claims 1-4 under conditions in 

10 which the protein encoded by said nucleic acid molecule is expressed. 

11. The method of claim 10, wherein said host cell is selected from the group 
consisting of prokaryotic host cells and eukaryotic host cells. 

12. An isolated polypeptide produced by the method of claim 10. 

13. An isolated polypeptide or protein selected from the group consisting of a 
15 protein comprising the amino acid sequence of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16, and 

a protein having at least about 75% amino acid sequence identity with SEQ ID NO: 2, 4, 6, 
8. 10, 12, 14 or 16. 

14. An isolated antibody or antigen-binding antibody fragment that binds to a 
polypeptide of claim 13. 

20 IS. An antibody of claim 14 wherein said antibody is a monoclonal or a 

polyclonal antibody. 

16. A method of identifying an agent which modulates the expression of a 
nucleic acid encoding a protein of claim 13, comprising: 
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exposing cells which express the nucleic acid to the agent; and 

determining whether the agent modulates expression of said nucleic acid, 
thereby identifying an agent which modulates the expression of a nucleic acid 
encoding the protein. 

5 1 7. A method of identi^ing an agent which modulates the level of or at least one 

activity of a protein of claim 13, comprising: 

exposing cells which express the protein to the agent; 

determining whether the agent modulates the level of or at least one activity 
of said protein, thereby identifying an agent which modulates the level of or at least 
1 0 one activity of the protein, 

18. The method of claim 17, wherein the agent modulates one activity of the 

protein. 

19. A method of modulating the expression of a nucleic acid encoding a protein 
of claim 13, comprising: 

15 administering an effective amount of an agent which modulates the 

expression of a nucleic acid encoding the protein. 

20. A method of modulating at least one activity of a protein of claim 13, 
comprising: 

administering an effective amount of an agent which modulates at least one 
20 activity of the protein. 

21. A method of identifying binding parmers for a protein of claim 13, 
comprising: 

exposing said protein to a potential binding partner; and 
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determining if the potential binding partner binds to said protein, thereby 
identifying binding partners for the protein. 

22. A method of identifying an agent which modulates the uiteraction between a 
binding partner of claim 21 and a protein of claim 13, comprising: 

exposing said protein with said partner to the agent; and 

determining whether the agent modulates association of the binding partner 
with said protein, thereby identifying an agent which modulates association of a 
binding partner with said protein. 

23. A method of modulating the interaction between a binding partner of claim 
21 and a protein of claim 13, comprising: 

administering an effective amount of an agent which modulates association 
of a binding partner with said protein. 

24. A non-human transgenic animal modified to contain a nucleic acid molecule 
of any of claims 1-4. 

25. The transgenic animal of claim 24, wherein the nucleic acid molecule 
contains a mutation that prevents expression of the encoded protein. 

26. A method of treating a disease state in a subject, comprising: 

inserting into a diseased cell a gene construct comprising an isolated nucleic 
acid molecule of any one of claims 1-4 linked to a promoter or enhancer element 
such that expression of said nucleic molecule causes suppression of said disease. 

27. The method of claim 26, wherein said inserting into a diseased cell is 
accomplished in vivo. 



80 



wo 2004/035789 ^^CT/KR2003/002161 



28. The method of claim 26, wherein said inserting into a diseased cell further 
comprises use of a viral or plasmid agent and is accomplished either in vitro or in vivo. 

29. A method of diagnosing a disease state in a subject, comprising: 

determining the level of expression of a nucleic acid molecule or protein of 
5 any one of claims 1-4 or 13. 

30. The method of claims 26 and 29, wherein the disease state is cancer. 

31. The method of claims 26 and 29, wherein the disease state is a malignant 
neoplasm. 

32. The method of claim 31, wherein the malignant neoplasm occurs in the 
10 breast, colon, esophagus, kidney, liver, lung, lymph node, ovary, pancreas, prostate, rectum, 

and/or stomach. 

33. A composition comprising a diluent and a polypeptide or protein selected 
ftom the group consisting of: an isolated polypeptide comprising the amino acid sequence 
of SEQ ID NO: 2. 4, 6. 8, 10, 12, 14 or 16; an isolated polypeptide comprising a fragment of 

1 5 at least 10 amino acids of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 16; an isolated polypeptide 
comprising conservative amino acid substitutions of SEQ ID NO: 2, 4, 6, 8, 10, 12, 14 or 
16;an isolated polypeptide comprising naturally occurring amino acid sequence variants of 
SEQ ID NO: 2, 4, 6, 8, 10, 12. 14 or 16; and an isolated polypeptide exhibiting at least 
about 75% amino acid sequence identity with SEQ ID NO:2, 4, 6, 8, 10, 12, 14 or 16. 

20 
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DRAWINGS 
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FIG. 2 
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FIG. 4 
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FIG. 5 
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Sequence Listing 



<110> LG Life Sciences Ltd. 

<120> Gene Famililies Associated With Cancers 

<130> LL03PCT006 

<150> US60/4199XX 

<151> 2002-10-18 

<150> US60/419912 

<151> 2002-10-18 

<150> US60/420088 

<151> 2002-10-18 

<150> US60/434243 

<151> 2002-12-16 

<150> US60/434278 

<151> 2002-12-16 

<150> US60/438278 

<151> 2003-01-03 

<160> 56 

<170> Kopatentin 1.71 

<210> 1 

<211> 5293 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (390) . . (4880) 

<223> LBFL109 Clone A 
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gtcctgtctg gcggtgccga cggtgagggg cggtggccca acggcgggag attcaaacct 60 

ggaagaagga ggaacatgga gaggagagca gcgggcccag gctgggcagc ctctggatcg 120 

aggcctgcct gaaggaggag cttccttccc cggtggagct ggaggagagc cttcggaatg 180 

gagtgctgct ggccaagctg ggccactgtt ttgcaccctc cgtggttccg ttgaagaaga 240 

tctacgatgt ggagcagctg cggtaccagg caactggctt acatttccgt cacacagaca 300 

acatcaactt ttggctatct gcaatagccc acatcggtct gccttcgacc ttcttcccag 360 

agaccacgga catctatgac aaaaagaac atg ccc egg gta gtc tac tgc ate 413 

Met Pro Arg Val Val Tyr Cys lie 
1 5 

cat get cte agt etc ttc etc ttc egg ctg gga ttg gee cct cag ata 4 61 

His Ala Leu Ser Leu Phe Leu Phe Arg Leu Gly Leu Ala Pro Gin He 
10 15 20 

cat gat eta tac ggg aaa gtg aaa ttc aca get gag gaa etc age aac 509 
His Asp Leu Tyr Gly Lys Val Lys Phe Thr Ala Glu Glu Leu Ser Asn 
25 30 35 40 

atg gcg tec gaa ctg gcc aaa tat ggc etc cag ctg cct gcc ttc age 557 
Met Ala Ser Glu Leu Ala Lys Tyr Gly Leu Gin Leu Pro Ala Phe Ser 
45 50 55 

aag ate ggg ggc ate ttg gee aat gag etc teg gtg gat gag get gca 605 
Lys He Gly Gly He Leu Ala Asn Glu Leu Ser Val Asp Glu Ala Ala 
60 65 70 

gtc cat gca get gtt ctt gee ate aat gaa gca gtg gag cga ggg gtg 653 
Val His Ala Ala Val Leu Ala He Asn Glu Ala Val Glu Arg Gly Val 
75 80 85 

gtg gag gac ace ctg get gcc ttg cag aat eec agt get ctt ctg gag 701 
Val Glu Asp Thr Leu Ala Ala Leu Gin Asn Pro Ser Ala Leu Leu Glu 
90 95 100 
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aat etc cga gag cct ctg gca gcc gtc tac cag gag atg ctg gcc cag 
Asn Leu Arg Glu Pro Leu Ala Ala Val Tyr Gin Glu Met Leu Ala Gin 



105 



110 lis 120 



gcc aag atg gag aag gca gcc aat gcc agg aac cat gat gac aga gaa 
Ala Lys Met Glu Lys Ala Ala Asn Ala Arg Asn His Asp Asp Arg Glu 
125 130 

age cag gac ate tat gac cac tac eta act cag get gaa ate cag gge 
Ser Gin Asp He Tyr Asp His Tyr Leu Thr Gin Ala Glu He Gin Gly 
140 1« 150 

aat ate aac cat gtc aac gtc cat ggg get eta gaa gtt gtt gat gat 
Asn lie Asn His Val Asn val His Gly Ala Leu Glu Val Val Asp Asp 
155 160 

gcc ctg gaa aga cag age cct gaa gcc ttg etc aag gcc ctt caa gac 
Ala Leu Glu Arg Gin Ser Pro Glu Ala Leu Leu Lys Ala Leu Gin Asp 
170 "5 180 

cct gcc ctg gcc ctg cga ggg gtg agg aga gac ttt get gac tgg tac 
pro Ala Leu Ala Leu Arg Gly Val Arg Arg Asp Phe Ala Asp Trp Tyr 



185 



190 IW 200 



ctg gag cag ctg aac tea gac aga gag cag aag gca cag gag ctg gge 
Leu Glu Gin Leu Asn Ser Asp Arg Glu Gin Lys Ala Gin Glu Leu Gly 
205 210 215 

ctg gtg gag ctt ctg gaa aag gag gaa gtc cag get ggt gtg get gca 
Leu val Glu Leu Leu Glu Lys Glu Glu Val Gin Ala Gly Val Ala Ala 
220 225 230 

gcc aac aca aag ggt gat cag gaa caa gcc atg etc cac get gtg cag 
Ala Asn Thr Lys Gly Asp Gin Glu Gin Ala Met Leu His Ala Val Gin 
235 240 245 

egg ate aac aaa gcc ate egg agg gga gtg geg get gac act gtg aag 
Arg He Asn Lys Ala He Arg Arg Gly Val Ala Ala Asp Thr Val Lys 
250 255 260 



749 



797 



845 



893 



941 



989 



1037 



1085 



1133 



1181 
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gag ctg atg tgc cct gag gcc cag ctg cct cca gtg tac cct gtt gca 
Glu Leu Met Cys Pro Glu Ala Gin Leu Pro Pro Val Tyr Pro Val Ala 
265 270 215 280 

teg tct atg tac cag ctg gag ctg gca gtg etc cag cag cag cag ggg 
Ser ser Met Tyr Gin Leu Glu Leu Ala val Leu Gin Gin Gin Gin Gly 
285 290 295 

gag ctt ggc cag gag gag etc ttc gtg get gtg gag atg etc tea get 
Glu Leu Gly Gin Glu Glu Leu Phe val Ala val Glu Met Leu Ser Ala 
300 305 310 

gtg gtc ctg att aac egg gcc ctg gag gcc egg gat gee agt ggc ttc 
Val val Leu lie Aan Arg Ala Leu Glu Ala Arg Asp Ala Ser Gly Phe 
315 320 325 

tgg age age ctg gtg aac cct gcc aca ggc ctg get gag gtg gaa gga 
Trp ser Ser Leu Val Asn Pro Ala Thr Gly Leu Ala Glu val Glu Gly 
330 335 340 

gaa aat gcc cag cgt tac ttc gat gcc ctg ctg aaa ttg cga cag gag 
Glu Asn Ala Gin Arg Tyr Phe Asp Ala Leu Leu Lys Leu Arg Gin Glu 
345 350 355 360 

cgt ggg atg ggt gag gac ttc ctg age tgg aat gac ctg cag gee acc 
Arg Gly Met Gly Glu Asp Phe Leu Ser Trp Asn Asp Leu Gin Ala Thr 
365 370 375 

gtg age cag gtc aat gca eag ace cag gaa gag act gac egg gtc ctt 
val Ser Gin Val Asn Ala Gin Thr Gin Glu Glu Thr Asp Arg Val Leu 
380 385 390 

gca gtc age etc ate aat gag get ctg gae aaa ggc age cct gag aag 
Ala val ser Leu lie Asn Glu Ala Leu Asp Lys Gly Ser Pro Glu Lys 
395 400 

act ctg tct gcc eta ctg ctt cet gea get ggc eta gat gat gtc age 
Thr Leu Ser Ala Leu Leu Leu Pro Ala Ala Gly Leu Asp Asp Val Ser 
410 "5 420 



1229 



1277 



1325 



1373 



1421 



1469 



1517 



1565 



1613 



1661 
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etc cct gtc gcc cct egg tac cat etc etc ctt gtg gca gee aaa agg 
Leu pro val iUa Pro Arg Tyr His Leu Leu Leu Val Ala Ala Lys Arg 
425 



430 



435 



440 



cag aag gcc cag gtg aca ggg gat cct gga get gtg ctg tgg ctt gag 
Gin Lys Ala Gin Val Thr Gly Asp Pro Gly Ala Val Leu Trp Leu Gl« 
44S 



450 «5 



gag ate cgc cag gga gtg gtc aga gcc aac cag gac act aat aca get 
Glo lie Arg Gin Gly val Val Arg Ala Asn Gin Asp Thr Asn Thr Ala 
460 



465 



cag aga atg get ctt ggt gtg get gee ate aat eaa gee ate aag gag 
Gin Arg Met Ala Leu Gly val Ala Ala He Asn Gin Ala He Lys Glu 



475 



480 ^85 



ggc aag gca gcc cag act gag egg gtg ttg agg aac ecc gca gtg gcc 
Gly Lys Ala Ala Gin Thr Glu Arg Val Leu Arg Asn Pro Ala Val Ala 

495 500 



490 



ett cga ggg gta gtt cce gac tgt gee aac gge tae cag ega gee ctg 
Leu Arg Gly Val Val Pro Asp Cys Ala Asn Gly Tyr Gin Arg Ala Leu 
505 

gaa agt gee atg gca aag aaa eag cgt cca gca gae aca get tte tgg 
Glu ser Ala Met Ala Lys Lys Gin Arg Pro Ala Asp Thr Ala Phe Trp 
525 "° 

gtt caa cat gac atg aag gat ggc act gee tae tac tte eat ctg eag 
val Gin His ASP Met Lys Asp Gly Thr Ala Tyr Tyr Phe His Leu Gin 
540 5« "° 



ace 



tte cag ggg ate tgg gag eaa cct cct gge tgc ecc etc aac ace 
Thr Phe Gin Gly He Trp Glu Gin Pro Pro Gly Cys Pro Leu Asn Thr 
555 



560 565 



tct cae ctg ace egg gag gag ate eag tea get gtc acc aag gtc act 
ser His Leu Thr Arg Glu Glu He Gin Ser Ala Val Thr Lys Val Thr 
570 575 580 



1709 



1757 



1805 



1853 



1901 



1949 



1997 



2045 



2093 



2141 
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get gcc tat gac cgc caa cag etc tgg aaa gee aac gtc ggc ttt gtt 
Ala AXa Tyr Asp Arg Gin Gin Leu Trp Lys Ala Asn Val Gly Phe Val 
585 590 595 600 

ate cag etc cag gcc cgc etc cgt ggc ttc eta gtt egg cag aag ttt 
lie Gin Leu Gin Ala Arg Leu Arg Gly Phe Leu Val Arg Gin Lys Phe 
605 610 615 

get gag cat tec cac ttt ctg agg acc tgg etc eca gca gtc ate aag 
Ala Glu His Ser His Phe Leu Arg Thr Trp Leu Pro Ala val He Lys 
620 625 630 

ate eag get eat tgg egg ggt tat agg cag egg aag att tac ctg gag 
lie Gin Ala His Trp Arg Gly Tyr Arg Gin Arg Lys He Tyr Leu Glu 
635 640 645 

tgg ttg cag tat ttt aaa gea aac ctg gat gee ata ate aag ate cag 
Trp Leu Gin Tyr Phe Lys Ala Asn Leu Asp Ala He He Lys He Gin 
650 655 660 

gcc tgg gee egg atg tgg gca get egg agg caa tac ctg agg egt ctg 
Ala Trp Ala Arg Met Trp Ala Ala Arg Arg Gin Tyr Leu Arg Arg Leu 
665 670 675 680 

cac tac ttc eag aag aat gtt aac tee att gtg aag ate cag gca ttt 
His Tyr Phe Gin Lys Asn Val Asn Ser He Val Lys He Gin Ala Phe 
685 690 695 

ttc cga gcc agg aaa gcc caa gat gac tac agg ata tta gtg cat gca 
Phe Arg Ala Arg Lys Ala Gin Asp Asp Tyr Arg He Leu Val His Ala 
700 "JOS ■'10 

cec cac cct cct etc agt gtg gta cgc aga ttt gee cat etc ttg aat 
Pro His pro Pro Leu Ser Val Val Arg Arg Phe Ala His Leu Leu Asn 
715 720 725 

caa age eag caa gac ttc ttg get gag gca gag ctg ctg aag etc cag 
Gin Ser Gin Gin Asp Phe Leu Ala Glu Ala Glu Leu Leu Lys Leu Gin 
730 735 740 



2189 



2237 



2285 



2333 



2381 



2429 



2477 



2525 



2573 



2621 
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gaa gag gta gtt agg aag ate cga tec aat cag cag ctg gag cag gac 2669 
Glu Glu Val VaX Arg Lys lie Arg Ser Asn Gin Gin Leu Glu Gin Asp 
745 750 755 760 

etc aac ate atg gac ate aag att ggc ctg ctg gtg aag aac egg ate 2717 
Leu Asn He Met Asp He Lys He Gly Leu Leu Val Lys Asn Arg He 
765 770 775 

act ctg cag gaa gtg gtc tec cae tgc aag aag ctg acc aag agg aat 2765 
Thr Leu Gin Glu Val Val Ser His Cys Lys Lys Leu Thr Lys Arg Asn 
780 785 790 

aag gaa cag ctg tea gat atg atg gtt ctg gac aag cag aag ggt tta 2813 
Lys Glu Gin Leu Ser Asp Met Met Val Leu Asp Lys Gin Lys Gly Leu 
795 800 805 

aag teg ctg age aaa gag aaa egg cag aaa eta gaa gea tac caa cae 2861 
Lys Ser Leu Ser Lys Glu Lys Arg Gin Lys Leu Glu Ala Tyr Gin His 
810 815 820 

etc ttc tac ctg etc cag act cag cce ate tac ctg gee aag ctg ate 2909 
Leu Phe Tyr Leu Leu Gin Thr Gin Pro He Tyr Leu Ala Lys Leu He 
825 830 835 840 

ttt cag atg cea cag aac aaa ace aee aag ttc atg gag gea gtg att 2957 
Phe Gin Met Pro Gin Asn Lys Thr Thr Lys Phe Met Glu Ala Val He 
845 850 855 

ttc age ctg tac aac tat gee tec age cge cga gag gee tat etc ctg 3005 
Phe Ser Leu Tyr Asn Tyr Ala Ser Ser Arg Arg Glu Ala Tyr Leu Leu 
860 865 870 

etc cag ctg ttc aag aca gea etc cag gag gaa ate aag tea aag gtg 3053 
Leu Gin Leu Phe Lys Thr Ala Leu Gin Glu Glu He Lys Ser Lys Val 
875 880 885 

gag cag cce cag gac gtg gtg aca ggc aac eca aca gtg gtg agg ctg 3101 
Glu Gin Pro Gin Asp Val Val Thr Gly Asn Pro Thr Val Val Arg Leu 
890 895 900 
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gtg gtg aga ttc tac cgt aat ggg egg gga cag agt gcc ctg cag gag 
Val val Arg Phe Tyr Arg Asn Gly Arg Gly Gin Ser Ala Leu Gin Glu 
905 910 915 920 



3149 



att ctg ggc aag gtt ate cag gat gtg eta gaa gac aaa gtg etc age 
lie Leu Gly Lys Val He Gin Asp Val Leu Glu Asp Lys Val Leu Ser 
925 930 935 



3197 



gtc cac aca gac cct gtc cac etc tat aag aac tgg ate aac eag act 
Val His Thr Asp Pro Val His Leu Tyr Lys Asn Trp He Asn Gin Thr 
940 945 950 



3245 



gag gcc cag aca ggg cag cgc age eat etc cea tat gat gtc acc ccg 
Glu Ala Gin Thr Gly Gin Arg Ser His Leu Pro Tyr Asp Val Thr Pro 
955 960 965 



3293 



gag cag gcc ttg age cac ccc gag gtc cag aga cga ctg gac ate gcc 
Glu Gin Ala Leu Ser His Pro Glu Val Gin Arg Arg Leu Asp He Ala 
970 975 980 



3341 



eta cgc aac etc etc gcc atg act gat aag ttc ctt tta gcc ate acc 
Leu Arg Asn Leu Leu Ala Met Thr Asp Lys Phe Leu Leu Ala He Thr 
985 990 995 1000 



3389 



tea tct gtg gac caa att ccg tat ggg atg cga tat gtg gcc aaa gtc 
Ser Ser Val Asp Gin He Pro Tyr Gly Met Arg Tyr Val Ala Lys Val 
1005 1010 1015 



3437 



ctg aag gea act ctg gea gag aaa ttc cct gac gcc aca gac age gag 
Leu Lys Ala Thr Leu Ala Glu Lys Phe Pro Asp Ala Thr Asp Ser Glu 
1020 1025 1030 



3485 



gtc tat aag gtg gtc ggg aac etc ctg tac tac cgc ttc ctg aac cea 
Val Tyr Lys Val Val Gly Asn Leu Leu Tyr Tyr Arg Phe Leu Asn Pro 
1035 1040 1045 



3533 



get gtg gtg get cct gac gcc ttc gac att gtg gcc atg gea get ggt 
Ala val Val Ala Pro Asp Ala Phe Asp He Val Ala Met Ala Ala Gly 
1050 1055 1060 



3581 
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gga gcc ctg get gcc ccc cag cgc cat gcc ctg ggg get gtg get cag 
Gly Ala Leu Ala Ala Pro Gin Arg His Ala Leu Gly Ala Val Ala Gin 
1065 1070 1075 1080 



3629 



cte eta cag cac get gcg get ggc aag gcc tte tct ggg cag age cag 
Leu Leu Gin His Ala Ala Ala Gly Lys Ala Phe Ser Gly Gin Ser Gin 
1085 1090 1095 



3677 



cac eta egg gtc ctg aat gac tat ctg gag gaa aca cac cte aag ttc 
His Leu Arg Val Leu Asn Asp Tyr Leu Glu Glu Thr His Leu Lys Phe' 
1100 1105 1110 



3725 



agg aag ttc ate cat aga gcc tgc cag gtg cca gag cca gag gag cgt 
Arg Lys Phe He His Arg Ala Cys Gin Val Pro Glu Pro Glu Glu Arg 
1115 1120 1125 



3773 



ttt gca gtg gac gag tac tea gac atg gtg get gtg gcc aaa ccc atg 
Phe Ala Val Asp Glu Tyr Ser Asp Met Val Ala Val Ala Lys Pro Met 
1130 1135 1140 



3821 



gtg tac ate acc gtg ggg gag ctg gtc aac acg cac agg ctg ttg ctg 
Val Tyr He Thr Val Gly Glu Leu Val Asn Thr His Arg Leu Leu Leu 
1145 1150 1155 1160 



3869 



gag cac cag gac tgc att gcc cct gat cac caa gac ccc ctg cat gag 
Glu His Gin Asp Cys He Ala Pro Asp His Gin Asp Pro Leu His Glu 
1165 1170 1175 



3917 



etc ctg gag gat ett ggg gag ctg ccc acc ate cct gac ctt att ggt 
Leu Leu Glu Asp Leu Gly Glu Leu Pro Thr He Pro Asp Leu He Gly 
1180 1185 1190 



3965 



gag age ate get gca gat ggg cac aca gac ctg age aag eta gaa gtg 
Glu Ser He Ala Ala Asp Gly His Thr Asp Leu Ser Lys Leu Glu Val 
1195 1200 1205 



4013 



tec ctg acg ctg ace aac aag ttt gaa gga eta gag gca gat get gat 
Ser Leu Thr Leu Thr Asn Lys Phe Glu Gly Leu Glu Ala Asp Ala Asp 
1210 1215 1220 



4061 
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gac tec aac acc cgt age ctg ctt ctg age acc aag cag ctg ttg gcc 
Asp Ser Asn Thr Arg Ser Leu Leu Leu Ser Thr Lys Gin Leu Leu Ala 
1225 1230 1235 1240 



4109 



gat ate ata eag tte cat eet ggg gac aee etc aag gag ate ctg tee 
Asp lie lie Gin Phe His Pro Gly Asp Thr Leu Lys Glu He Leu Ser 
1245 1250 1255 



4157 



etc teg get tec aga gag caa gaa gea gee eac aag cag ctg atg age 
Leu Ser Ala Ser Arg Glu Gin Glu Ala Ala His Lys Gin Leu Met Ser 
1260 1265 1270 



4205 



cga egc eag gcc tgt aca gee eag aca eeg gag eca ctg cga ega eac 
Arg Arg Gin Ala Cys Thr Ala Gin Thr Pro Glu Pro Leu Arg Arg His 
1275 1280 1285 



4253 



cgc tea ctg aca get eac tec etc ctg eca ctg gea gag aag cag egg 
Arg Ser Leu Thr Ala His Ser Leu Leu Pro Leu Ala Glu Lys Gin Arg 
1290 1295 1300 



4301 



cgc gtc ctg egg aac ctg cgc cga ctt gaa gcc ctg ggg ttg gtc age 
Arg Val Leu Arg Asn Leu Arg Arg Leu Glu Ala Leu Gly Leu Val Ser 
1305 1310 1315 1320 



4349 



gcc aga aat ggc tac cag ggg eta gtg gac gag ctg gcc aag gac ate 
Ala Arg Asn Gly Tyr Gin Gly Leu Val Asp Glu Leu Ala Lys Asp He 
1325 1330 1335 



4397 



cgc aac cag cac aga cac agg eac agg egg aag gea gag ctg gtg aag 
Arg Asn Gin His Arg His Arg His Arg Arg Lys Ala Glu Leu Val Lys 
1340 1345 1350 



4445 



ctg eag gcc aca tta cag ggc ctg age act aag aee acc ttc tat gag 
Leu Gin Ala Thr Leu Gin Gly Leu Ser Thr Lys Thr Thr Phe Tyr Glu 
1355 1360 1365 



4493 



gag cag ggt gac tac tac age cag tac ate egg gee tge ctg gac cac 
Glu Gin Gly Asp Tyr Tyr Ser Gin Tyr He Arg Ala Cys Leu Asp His 
1370 1375 1380 



4541 
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ctg gcc ccc gac tec aag agt tct ggg aag ggg aag aag cag cct tct 
Leu Ala Pro Asp Ser Lys Ser Ser Gly Lys Gly Lys Lys Gin Pro Ser 
1385 1390 1395 1400 



4589 



ctt cat tac act get get cag etc ctg gaa aag ggt gtc ttg gtg gaa 
Leu His Tyr Thr Ala Ala Gin Leu Leu Glu Lys Gly Val Leu Val Glu 
1405 1410 1415 



4637 



att gaa gat ctt ccc gee tct cac ttc aga aac gtc ate ttt gac ate 
lie Glu Asp Leu Pro Ala Ser His Phe Arg Asn Val He Phe Asp He 
1420 1425 1430 



4685 



aeg ccg gga gat gag gca gga aag ttt gaa gta aat gcc aag ttc ctg 
Thr Pro Gly Asp Glu Ala Gly Lys Phe Glu Val Asn Ala Lys Phe Leu 
1435 1440 1445 



4733 



ggt gtg gac atg gag cga ttt cag ctt cac tat cag gat etc ctg cag 
Gly Val Asp Met Glu Arg Phe Gin Leu His Tyr Gin Asp Leu Leu Gin 
1450 1455 1460 



4781 



etc cag tat gag ggt gtg get gtc atg aaa etc ttc aac aag gee aaa 
Leu Gin Tyr Glu Gly Val Ala Val Met Lys Leu Phe Asn Lys Ala Lys 
1465 1470 1475 1480 



4829 



gtc aat gtc aac ctt etc ate ttc etc etc aac aag aag ttt ttg egg 
Val Asn Val Asn Leu Leu He Phe Leu Leu Asn Lys Lys Phe Leu Arg 
1485 1490 1495 



4877 



aag 
Lys 



tgacagaggc aaagggtgct aeccaagece ctettacetc tctggatget 



ttetttaaea etaactcace actgtgettc cetgeagaca cecagagetc aggactggge 
aaggcccagg gatteteace cettceccag etgggaggag cttgeetgce tggceacaga 
cagtgtatct tetaattggc taaagtgggc ettgcccaga gtceagetgt gtggctttta 
teatgeatga caaacccctg getttcctgc cagatggatt etcatecctt acagctgact 
cttccaggca atttccatag atctgcagtc ctgcctetge cacagtctct ctgttgtcee 



4930 

4990 
5050 
5110 
5170 
5230 
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cacatctacc caacttcctg tactgttgcc cttctgatgt taataaaagc agctgttact 



ccc 



<210> 2 

<211> 1497 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Met Pro Arg Val Val Tyr Cys He His Ala Leu Ser Leu Phe Leu Phe 
15 10 15 

Arg Leu Gly Leu Ala Pro Gin He His Asp Leu Tyr Gly Lys Val Lys 
20 25 30 

Phe Thr Ala Glu Glu Leu Ser Asn Met Ala Ser Glu Leu Ala Lys Tyr 
35 40 45 

Gly Leu Gin Leu Pro Ala Phe Ser Lys He Gly Gly He Leu Ala Asn 
50 55 60 

Glu Leu Ser Val Asp Glu Ala Ala Val His Ala Ala Val Leu Ala He 
65 70 75 80 

Asn Glu Ala Val Glu Arg Gly Val Val Glu Asp Thr Leu Ala Ala Leu 
85 90 95 

Gin Asn Pro Ser Ala Leu Leu Glu Asn Leu Arg Glu Pro Leu Ala Ala 
100 105 110 

val Tyr Gin Glu Met Leu Ala Gin Ala Lys Met Glu Lys Ala Ala Asn 
1X5 120 125 

Ala Arg Asn His Asp Asp Arg Glu Ser Gin Asp He Tyr Asp His Tyr 
130 135 140 

Leu Thr Gin Ala Glu He Gin Gly Asn He Asn His Val Asn Val His 



5290 



5293 
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145 150 155 160 

Gly Ala Leu Glu Val Val Asp Asp Ala Leu Glu Arg Gin Ser Pro Glu 
165 170 175 

Ala Leu Leu Lys Ala Leu Gin Asp Pro Ala Leu Ala Leu Arg Gly Val 
180 185 190 

Arg Arg Asp Phe Ala Asp Trp Tyr Leu Glu Gin Leu Asn Ser Asp Arg 
195 200 205 

Glu Gin Lys Ala Gin Glu Leu Gly Leu Val Glu Leu Leu Glu Lys Glu 
210 215 220 

Glu Val Gin Ala Gly Val Ala Ala Ala Asn Thr Lys Gly Asp Gin Glu 
225 230 235 240 

Gin Ala Met Leu His Ala Val Gin Arg He Asn Lys Ala He Arg Arg 
245 250 255 

Gly Val Ala Ala Asp Thr Val Lys Glu Leu Met Cys Pro Glu Ala Gin 
260 265 270 

Leu Pro Pro Val Tyr Pro Val Ala Ser Ser Met Tyr Gin Leu Glu Leu 
275 280 285 

Ala Val Leu Gin Gin Gin Gin Gly Glu Leu Gly Gin Glu Glu Leu Phe 
290 295 300 

Val Ala Val Glu Met Leu Ser Ala Val Val Leu He Asn Arg Ala Leu 
305 310 315 320 

Glu Ala Arg Asp Ala Ser Gly Phe Trp Ser Ser Leu Val Asn Pro Ala 
325 330 335 

Thr Gly Leu Ala Glu Val Glu Gly Glu Asn Ala Gin Arg Tyr Phe Asp 
340 345 350 

Ala Leu Leu Lys Leu Arg Gin Glu Arg Gly Met Gly Glu Asp Phe Leu 
355 360 365 
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Sex Trp Asn Asp Leu Gin Ala Thr Val Ser Gin Val Asn Ala Gin Thr 
370 375 380 

Gin Glu Glu Thr Asp Arg Val Leu Ala Val Ser Leu lie Asn Glu Ala 
385 390 395 400 

Leu Asp Lys Gly Ser Pro Glu Lys Thr Leu Ser Ala Leu Leu Leu Pro 
405 410 415 

Ala Ala Gly Leu Asp Asp Val Ser Leu Pro Val Ala Pro Arg Tyr His 
420 425 430 

Leu Leu Leu Val Ala Ala Lys Arg Gin Lys Ala Gin Val Thr Gly Asp 
435 440 445 

Pro Gly Ala val Leu Trp Leu Glu Glu lie Arg Gin Gly Val Val Arg 
450 455 460 

Ala Asn Gin Asp Thr Asn Thr Ala Gin Arg Met Ala Leu Gly Val Ala 
465 470 475 480 

Ala He Asn Gin Ala He Lys Glu Gly Lys Ala Ala Gin Thr Glu Arg 
485 490 495 

Val Leu Arg Asn Pro Ala Val Ala Leu Arg Gly Val Val Pro Asp Cys 
500 505 510 

Ala Asn Gly Tyr Gin Arg Ala Leu Glu Ser Ala Met Ala Lys Lys Gin 
515 520 525 

Arg Pro Ala Asp Thr Ala Phe Trp Val Gin His Asp Met Lys Asp Gly 
530 535 540 

Thr Ala Tyr Tyr Phe His Leu Gin Thr Phe Gin Gly He Trp Glu Gin 
545 550 555 560 

Pro Pro Gly Cys Pro Leu Asn Thr Ser His Leu Thr Arg Glu Glu lie 
565 570 575 
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Gin Ser Ala Val Thr Lys Val Thr Ala Ala Tyr Asp Arg Gin Gin Leu 
580 585 590 

Trp Lys Ala Asn Val Gly Phe Val He Gin Leu Gin Ala Arg Leu Arg 
595 600 605 

Gly Phe Leu Val Arg Gin Lys .Phe Ala Glu His Ser His Phe Leu Arg 
610 615 620 

Thr Trp Leu Pro Ala Val He Lys He Gin Ala His Trp Arg Gly Tyr 
625 630 635 640 

Arg Gin Arg Lys He Tyr Leu Glu Trp Leu Gin Tyr Phe Lys Ala Asn 
645 650 655 

Leu Asp Ala He He Lys He Gin Ala Trp Ala Arg Met Trp Ala Ala 
660 665 670 

Arg Arg Gin Tyr Leu Arg Arg Leu His Tyr Phe Gin Lys Asn Val Asn 
675 680 685 

Ser He Val Lys He Gin Ala Phe Phe Arg Ala Arg Lys Ala Gin 'Asp 
690 695 700 

Asp Tyr Arg He Leu Val His Ala Pro His Pro Pro Leu Ser Val Val 
705 710 715 720 

Arg Arg Phe Ala His Leu Leu Asn Gin Ser Gin Gin Asp Phe Leu Ala 
725 730 735 

Glu Ala Glu Leu Leu Lys Leu Gin Glu Glu Val Val Arg Lys He Arg 
740 745 750 

Ser Asn Gin Gin Leu Glu Gin Asp Leu Asn He Met Asp He Lys He 
755 760 765 

Gly Leu Leu val Lys Asn Arg He Thr Leu Gin Glu Val Val Ser His 
770 775 780 

Cys Lys Lys Leu Thr Lys Arg Asn Lys Glu Gin Leu Ser Asp Met Met 
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785 790 795 800 

Val Leu Asp Lys Gin Lys Gly Leu Lys Ser Leu Ser Lys Glu Lys Arg 
805 8X0 815 

Gin Lys Leu Glu Ala Tyr Gin His Leu Phe Tyr Leu Leu Gin Thr Gin 
920 825 830 

Pro He Tyr Leu Ala Lys Leu He Phe Gin Met Pro Gin Asn Lys Thr 
835 840 845 

Thr Lys Phe Met Glu Ala Val He Phe Ser Leu Tyr Asn Tyr Ala Ser 
850 855 8eO 

Ser Arg Arg Glu Ala Tyr Leu Leu Leu Gin Leu Phe Lys Thr Ala Leu 
865 870 875 880 

Gin Glu Glu He Lys Ser Lys Val Glu Gin Pro Gin Asp Val Val Thr 
885 890 895 

Gly Asn Pro Thr Val Val Arg Leu Val Val Arg Phe Tyr Arg Asn Gly 
900 905 910 

Arg Gly Gin Ser Ala Leu Gin Glu He Leu Gly Lys Val He Gin Asp 
915 920 925 

Val Leu Glu Asp Lys Val Leu Ser Val His Thr Asp Pro Val His Leu 
930 935 940 

Tyr Lys Asn Trp He Asn Gin Thr Glu Ala Gin Thr Gly Gin Arg Ser 
945 950 955 960 

His Leu Pro Tyr Asp Val Thr Pro Glu Gin Ala Leu Ser His Pro Glu 
965 970 975 

Val Gin Arg Arg Leu Asp He Ala Leu Arg Asn Leu Leu Ala Met Thr 
980 985 990 

Asp Lys Phe Leu Leu Ala He Thr Ser Ser Val Asp Gin He Pro Tyr 
995 1000 1005 
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Gly Met Arg Tyr Val Ala Lys Val Leu Lys Ala Thr Leu Ala Glu Lys 
1010 1015 1020 

Phe Pro Asp Ala Thr Asp Ser Glu Val Tyr Lys Val Val Gly Asn Leu 
1025 1030 1035 1040 

Leu Tyr Tyr Arg Phe Leu Asn Pro Ala val Val Ala Pro Asp Ala Phe 
1045 1050 1055 

Asp lie Val Ala Met Ala Ala Gly Gly Ala Leu Ala Ala Pro Gin Arg 
1060 1065 1070 

His Ala Leu Gly Ala Val Ala Gin Leu Leu Gin His Ala Ala Ala Gly 
1075 1080 1085 

Lys Ala Phe Ser Gly Gin Ser Gin His Leu Arg Val Leu Asn Asp Tyr 
1090 1095 1100 

Leu Glu Glu Thr His Leu Lys Phe Arg Lys Phe lie His Arg Ala Cys 
1105 1110 1115 1120 

Gin Val Pro Glu Pro Glu Glu Arg Phe Ala Val Asp Glu Tyr Ser Asp 
1125 1130 1135 

Met val Ala Val Ala Lys Pro Met Val Tyr lie Thr Val Gly Glu Leu 
1140 1145 1150 

Val Asn Thr His Arg Leu Leu Leu Glu His Gin Asp Cys He Ala Pro 
1155 1160 1165 

Asp His Gin Asp Pro Leu His Glu Leu Leu Glu Asp Leu Gly Glu Leu 
1170 1175 1180 

Pro Thr He Pro Asp Leu He Gly Glu Ser He Ala Ala Asp Gly His 
1185 1190 1195 1200 

Thr Asp Leu Ser Lys Leu Glu Val Ser Leu Thr Leu Thr Asn Lys Phe 
1205 1210 1215 
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Glu Gly Leu Glu Ala Asp Ala Asp Asp Ser Asn Thr Arg Ser Leu Leu 
1220 1225 1230 

Leu Ser Thr Lys Gin Leu Leu Ala Asp He He Gin Phe His Pro Gly 
1235 1240 1245 

Asp Thr Leu Lys Glu He Leu Ser Leu Ser Ala Ser Arg Glu Gin Glu 
1250 1255 1260 

Ala Ala His Lys Gin Leu Met Ser Arg Arg Gin Ala Cys Thr Ala Gin 
1265 1270 1275 1280 

Thr Pro Glu Pro Leu Arg Arg His Arg Ser Leu Thr Ala His Ser Leu 
1285 1290 1295 

. Leu Pro Leu Ala Glu Lys Gin Arg Arg Val Leu Arg Asn Leu Arg Arg 
1300 1305 1310 

Leu Glu Ala Leu Gly Leu Val Ser Ala Arg Asn Gly Tyr Gin Gly Leu 
1315 1320 1325 

Val Asp Glu Leu Ala Lys Asp He Arg Asn Gin His Arg His Arg His 
1330 1335 1340 

Arg Arg Lys Ala Glu Leu Val Lys Leu Gin Ala Thr Leu Gin Gly Leu 
1345 1350 1355 1360 

ser Thr Lys Thr Thr Phe Tyr Glu Glu Gin Gly Asp Tyr Tyr Ser Gin 
1365 1370 1375 

Tyr He Arg Ala Cys Leu Asp His Leu Ala Pro Asp Ser Lys Ser Ser 
1380 1385 1390 

Gly Lys Gly Lys Lys Gin Pro Ser Leu His Tyr Thr Ala Ala Gin Leu 
1395 1400 1405 

Leu Glu Lys Gly Val Leu val Glu He Glu Asp Leu Pro Ala Ser His 
1410 1415 1420 

Phe Arg Asn Val He Phe Asp He Thr Pro Gly Asp Glu Ala Gly Lys 
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1425 



1430 



1435 



1440 



Phe Glu Val Asn Ala Lys Phe Leu Gly Val Asp Met Glu Arg Phe Gin 
1445 1450 1455 

Leu His Tyr Gin Asp Leu Leu Gin Leu Gin Tyr Glu Gly Val Ala val 
1460 1465 1470 

Met Lys Leu Phe Asn Lys Ala Lys Val Asn Val Asn Leu Leu lie Phe 
1475 1480 1485 

Leu Leu Asn Lys Lys Phe Leu Arg Lys 
1490 1495 



<210> 
<211> 
<212> 
<213> 



3 

5317 
DNA 

Homo sapiens 



<220> 
<221> 
<222> 
<223> 



CDS 

(12) . . (4904) 
LBFL109 Clone B 



<400> 3 
gaaggaggaa c 



atg gag agg aga gca gcg ggc oca ggc tgg gca 
Met Glu Arg Arg Ala Ala Gly Pro Gly Trp Ala 
15 10 



44 



gcc tat gad cgc etc aca get gag gag atg gat gag cag agg egg cag 
Ala Tyr Glu Arg Leu Thr Ala Glu Glu Met Asp Glu Gin Arg Arg Gin 
15 20 25 



92 



aat gtt gcc tat cag tac ctg tgc egg ctg gag gag gcc aag cgc tgg 
Asn Val Ala Tyr Gin Tyr Leu Cys Arg Leu Glu Glu Ala Lys Arg Trp 
30 35 40 



140 



atg gag gcc tgc ctg aag gag gag ctt cct tec ccg gtg gag ctg gag 



188 



-19- 



gcc ttc age aag ate ggg ggc ate ttg gcc aat gag etc teg gtg gat 
Ala Phe Ser Lys lie Gly Gly He Leu Ala Asn Glu Leu Ser Val Asp 
190 195 200 

gag get gca gtc cat gea get gtt ctt gee ate aat gaa gea gtg gag 



236 



284 



332 
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Met Glu Ala Cys Leu Lys Glu Glu Leu Pro Ser Pro Val Glu Leu Glu 
45 50 55 

gag age ctt egg aat gga gtg ctg ctg gcc aag etg ggc cac tgt ttt 
Glu Ser Leu Arg Asn Gly Val Leu Leu Ala Lys Leu Gly His Cys Phe 
60 65 70 75 

gca ccc tec gtg gtt ccg ttg aag aag ate tac gat gtg gag eag ctg 
Ala Pro Ser Val Val Pro Leu Lys Lys lie Tyr Asp Val Glu Gin Leu 
80 85 90 

egg tac eag gca act ggc tta cat ttc cgt cac aca gae aac ate aac 
Arg Tyr Gin Ala Thr Gly Leu His Phe Arg His Thr Asp Asn He Asn 
95 100 105 

ttt tgg eta tet gca ata gcc cac ate ggt ctg cct teg ace ttc ttc 
Phe Trp Leu Ser Ala He Ala His He Gly Leu Pro Ser Thr Phe Phe 
110 115 120 

cca gag acc acg gac ate tat gac aaa aag aac atg ccc egg gta gtc 
Pro Glu Thr Thr Asp He Tyr Asp Lys Lys Asn Met Pro Arg Val Val 
125 130 135 

tac tgc ate cat get etc agt etc ttc etc ttc egg ctg gga ttg gcc 
Tyr Cys He His Ala Leu Ser Leu Phe Leu Phe Arg Leu Gly Leu Ala 
140 145 150 155 

cct eag ata cat gat eta tac ggg aaa gtg aaa ttc aca get gag gaa 
Pro Gin He His Asp Leu Tyr Gly Lys Val Lys Phe Thr Ala Glu Glu 
160 165 170 

etc age aac atg gcg tec gaa ctg gcc aaa tat ggc etc eag etg cct 572 
Leu Ser Asn Met Ala Ser Glu Leu Ala Lys Tyr Gly Leu Gin Leu Pro 
175 180 185 



380 



428 



476 



524 



620 



668 
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Glu Ala Ala Val His Ala Ala Val Leu Ala lie Asn Glu Ala Val Glu 
205 210 215 



cga ggg gtg gtg gag gac acc ctg get gcc ttg cag aat ccc agt get 
Arg Gly Val Val Glu Asp Thr Leu Ala Ala Leu Gin Asn Pro Ser Ala 
220 225 230 235 



716 



ctt ctg gag aat etc cga gag cct ctg gca gcc gtc tac eag gag atg 
Leu Leu Glu Asn Leu Arg Glu Pro Leu Ala Ala Val Tyr Gin Glu Met 
240 245 250 



764 



ctg gee cag gee aag atg gag aag gca gcc aat gcc agg aac cat gat 
Leu Ala Gin Ala Lys Met Glu Lys Ala Ala Asn Ala Arg Asn His Asp 
255 260 265 



812 



gac aga gaa age cag gac ate tat gac cac tac eta act cag get gaa 
Asp Arg Glu Ser Gin Asp lie Tyr Asp His Tyr Leu Thr Gin Ala Glu 
270 275 280 



860 



ate cag gge aat ate aac cat gtc aac gtc cat ggg get eta gaa gtt 
lie Gin Gly Asn lie Asn His Val Asn Val His Gly Ala Leu Glu Val 
285 290 295 



908 



gtt gat gat gee ctg gaa aga cag age cct gaa gcc ttg etc aag gee 
Val Asp Asp Ala Leu Glu Arg Gin Ser Pro Glu Ala Leu Leu Lys Ala 
300 305 310 315 



956 



ctt caa gac cct gcc ctg gcc ctg cga ggg gtg agg aga gac ttt get 
Leu Gin Asp Pro Ala Leu Ala Leu Arg Gly Val Arg Arg Asp Phe Ala 
320 325 330 



1004 



gac tgg tac ctg gag cag ctg aac tea gac aga gag cag aag gca cag 
Asp Trp Tyr Leu Glu Gin Leu Asn Ser Asp Arg Glu Gin Lys Ala Gin 
335 340 345 



1052 



gag ctg gge ctg gtg gag ctt ctg gaa aag gag gaa gtc cag get ggt 
Glu Leu Gly Leu Val Glu Leu Leu Glu Lys Glu Glu Val Gin Ala Gly 
350 355 360 

gtg get gca gee aac aca aag ggt gat cag gaa caa gcc atg etc cac 



1100 



1148 
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Val Ala Ala Ala Asn Thr Lys Gly Asp Gin Glu Gin Ala Met Leu His 
365 370 375 



get gtg cag egg ate aac aaa gee ate egg agg gga gtg gcg get gac 
Ala Val Gin Arg lie Asn Lys Ala lie Arg Arg Gly Val Ala Ala Asp 
380 385 390 395 



1196 



act gtg aag gag ctg atg tgc cct gag gee eag etg eet eea gtg tae 
Thr val Lys Glu Leu Met Cys Pro Glu Ala Gin Leu Pro Pro Val Tyr 
400 405 410 



1244 



cct gtt gca teg tct atg tac cag ctg gag ctg gca gtg etc cag cag 
Pro Val Ala Ser Ser Met Tyr Gin Leu Glu Leu Ala Val Leu Gin Gin 
415 420 425 



1292 



cag cag ggg gag ctt ggc cag gag gag etc ttc gtg get gtg gag atg 
Gin Gin Gly Glu Leu Gly Gin Glu Glu Leu Phe Val Ala Val Glu Met 
430 435 440 



1340 



etc tea get gtg gtc ctg att aac egg gee ctg gag gcc egg gat gee 
Leu Ser Ala Val Val Leu lie Asn Arg Ala Leu Glu Ala Arg Asp Ala 
445 450 455 

agt ggc ttc tgg age age etg gtg aac eet gee aea gge etg get gag 
Ser Gly Phe Trp Ser Ser Leu Val Asn Pro Ala Thr Gly Leu Ala Glu 
460 465 470 475 



1388 



1436 



gtg gaa gga gaa aat gcc cag egt tae ttc gat gee etg etg aaa ttg 
val Glu Gly Glu Asn Ala Gin Arg Tyr Phe Asp Ala Leu Leu Lys Leu 
480 485 490 



1484 



ega eag gag egt ggg atg ggt gag gac ttc ctg age tgg aat gae ctg 
Arg Gin Glu Arg Gly Met Gly Glu Asp Phe Leu Ser Trp Asn Asp Leu 
495 500 505 



1532 



cag gcc acc gtg age cag gtc aat gca eag ace cag gaa gag act gae 
Gin Ala Thr Val Ser Gin Val Asn Ala Gin Thr Gin Glu Glu Thr Asp 
510 515 520 



1580 



egg gtc ctt gca gtc age etc ate aat gag get ctg gac aaa ggc age 



1628 
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Arg Val Leu Ala Val Ser Leu lie Asn Glu Ala Leu Asp Lys Gly Ser 
525 530 535 



PCT/iCR2003/002161 



cct g<jo aag act ctg tct gcc eta ctg ctt cct gca get ggc eta gat 
Pro Glu Lys Thr Leu Ser Ala Leu Leu Leu Pro Ala Ala Gly Leu Asp 
540 545 550 555 



1676 



gat gtc age etc cct gtc gcc cct egg tae eat etc etc ctt gtg gca 
Asp Val Ser Leu Pro Val Ala Pro Arg Tyr His Leu Leu Leu Val Ala 
560 565 570 



1724 



gcc aaa agg cag aag gcc cag gtg aca ggg gat cct gga get gtg ctg 
Ala Lys Arg Gin Lys Ala Gin Val Thr Gly Asp Pro Gly Ala Val Leu 
575 580 585 



1772 



tgg ctt gag gag ate cgc cag gga gtg gtc aga gcc aac cag gac act 
Trp Leu Glu Glu lie Arg Gin Gly Val Val Arg Ala Asn Gin Asp Thr 
590 595 600 



1820 



aat aca get cag aga atg get ctt ggt gtg get gcc ate aat caa gcc 
Asn Thr Ala Gin Arg Met Ala Leu Gly Val Ala Ala He Asn Gin Ala 
605 610 615 



1868 



ate aag gag ggc aag gca gee cag act gag egg gtg ttg agg aac ccc 
He Lys Glu Gly Lys Ala Ala Gin Thr Glu Arg Val Leu Arg Asn Pro 
620 625 630 635 



1916 



gca gtg gcc ctt cga ggg gta gtt ccc gac tgt gcc aac ggc tae cag 
Ala Val Ala Leu Arg Gly Val Val Pro Asp Cys Ala Asn Gly Tyr Gin 
640 645 650 



1964 



cga gcc ctg gaa agt gcc atg gca aag aaa cag egt cca gca gac aca 
Arg Ala Leu Glu Ser Ala Met Ala Lys Lys Gin Arg Pro Ala Asp Thr 
655 660 665 



2012 



get tte tgg gtt caa cat gac atg aag gat ggc act gcc tae tae tte 
Ala Phe Trp Val Gin His Asp Met Lys Asp Gly Thr Ala Tyr Tyr Phe 
670 675 680 



2060 



cat ctg cag acc tte cag ggg ate tgg gag caa cct cct ggc tgc ccc 



2108 
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His I#eu Gin Thr Phe Gin Gly He Trp Glu Gin Pro Pro Gly Cys Pro 
685 690 695 
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etc aac acc tct cac ctg acc egg gag gag ate cag tea get gtc ace 
Leu Asn Thr Ser His Leu Thr Arg Glu Glu He Gin Ser Ala Val Thr 
700 705 710 715 



2156 



aag gtc act get gee tat gae egc caa cag etc tgg aaa gee aac gtc 
Lys Val Thr Ala Ala Tyr Asp Arg Gin Gin Leu Trp Lys Ala Asn Val 
720 725 730 



2204 



ggc ttt gtt ate cag etc cag gcc cgc etc cgt ggc ttc eta gtt egg 
Gly Phe Val He Gin Leu Gin Ala Arg Leu Arg Gly Phe Leu Val Arg 
735 740 745 



2252 



cag aag ttt get gag cat tec cac ttt ctg agg ace tgg etc eca gca 
Gin Lys Phe Ala Glu His Ser His Phe Leu Arg Thr Trp Leu Pro Ala 
750 755 760 



2300 



gtc ate aag ate cag get eat tgg egg ggt tat agg cag egg aag att 
Val He Lys He Gin Ala His Trp Arg Gly Tyr Arg Gin Arg Lys He 
765 770 775 



2348 



tac ctg gag tgg ttg cag tat ttt aaa gca aac ctg gat gcc ata ate 
Tyr Leu Glu Trp Leu Gin Tyr Phe Lys Ala Asn Leu Asp Ala He He 
780 785 790 795 



2396 



aag ate cag gcc tgg gcc egg atg tgg gca get egg agg caa tac ctg 
Lys He Gin Ala Trp Ala Arg Met Trp Ala Ala Arg Arg Gin Tyr Leu 
800 805 810 



2444 



agg cgt ctg cac tac ttc cag aag aat gtt aac tec att gtg aag ate 
Arg Arg Leu His Tyr Phe Gin Lys Asn Val Asn Ser He Val Lys He 
815 820 825 



2492 



cag gca ttt ttc cga gcc agg aaa gcc caa gat gac tac agg ata tta 
Gin Ala Phe Phe Arg Ala Arg Lys Ala Gin Asp Asp Tyr Arg He Leu 
830 835 840 



2540 



gtg eat gea eec cac cct cct etc agt gtg gta cgc aga ttt gcc cat 



2588 



-24- 



wo 2004/035789 



Sequence Listing 



PCT/KR2003/002161 



Val His Ala Pro His Pro Pro Leu Ser Val Val Arg Arg Phe Ala His 
845 850 855 



etc ttg aat caa age cag caa gac ttc ttg get gag gca gag etg ctg 
Leu Leu Asn Gin Ser Gin Gin Asp Phe Leu Ala Glu Ala Glu Leu Leu 
860 865 870 875 



2636 



aag etc cag gaa gag gta gtt agg aag ate cga tec aat cag eag ctg 
Lys Leu Gin Glu Glu Val Val Arg Lys lie Arg Ser Asn Gin Gin Leu 
880 885 890 



2684 



gag cag gac etc aac ate atg gac ate aag att ggc ctg ctg gtg aag 
Glu Gin Asp Leu Asn lie Met Asp He Lys He Gly Leu Leu Val Lys 
895 900 905 



2732 



aac egg ate act ctg cag gaa gtg gtc tec cac tgc aag aag ctg acc 
Asn Arg He Thr Leu Gin Glu Val Val Ser His Cys Lys Lys Leu Thr 
910 915 920 



2780 



aag agg aat aag gaa cag ctg tea gat atg atg gtt ctg gac aag cag 
Lys Arg Asn Lys Glu Gin Leu Ser Asp Met Met Val Leu Asp Lys Gin 
925 930 935 



2828 



aag ggt tta aag teg ctg age aaa gag aaa egg eag aaa eta gaa gca 
Lys Gly Leu Lys Ser Leu Ser Lys Glu Lys Arg Gin Lys Leu Glu Ala 
940 945 950 955 



2876 



tac caa cac etc ttc tac etg etc cag act cag cec ate tac ctg gee 
Tyr Gin His Leu Phe Tyr Leu Leu Gin Thr Gin Pro He Tyr Leu Ala 
960 965 970 



2924 



aag ctg ate ttt cag atg cea cag aac aaa acc ace aag ttc atg gag 
Lys Leu He Phe Gin Met Pro Gin Asn Lys Thr Thr Lys Phe Met Glu 
975 980 985 



2972 



gca gtg att ttc age etg tac aac tat gee tec age cgc cga gag gee 
Ala Val He Phe Ser Leu Tyr Asn Tyr Ala Ser Ser Arg Arg Glu Ala 
990 995 1000 



3020 



tat etc ctg etc cag ctg ttc aag aca gca etc cag gag gaa ate aag 



3068 



-25- 



, — PCT/KR2003/002161 
WO 2004/035789 



Sequence Listing 



Tyr Leu Leu ieu Gin Leu Phe Lys Thr Ala Leu Gin Glu Glu He Lys 
1005 1010 lOlS 

tea aag gtg gag cag ccc cag gac gtg gtg aca ggc aac cca aca gtg 
scr Lys Val Glu Gin Pro Gin Asp Val Val Thr Gly Asn Pro Thr Val 
1020 1025 1030 1035 

gtg agg ctg gtg gtg aga ttc tac cgt aat ggg egg gga cag agt gcc 
val Arg Leu Val Val Arg Phe Tyr Arg Asn Gly Arg Gly Gin Ser Ala 
1040 1045 1050 

ctg cag gag att ctg ggc aag gtt ate cag gat gtg eta gaa gac aaa 
Leu Gin Glu lie Leu Gly Lys Val He Gin Asp Val Leu Glu Asp Lys 
1055 1060 1065 

gtg etc age gte eac aca gac cct gtc cac etc tat aag aac tgg ate 
val Leu ser val His Thr Asp Pro Val His Leu Tyr Lys Asn Trp He 
1070 1075 1090 

aac cag act gag gee cag aca ggg cag egc age cat etc cca tat gat 
Asn Gin Thr Glu Ala Gin Thr Gly Gin Arg Ser His Leu Pro Tyr Asp 
1085 1090 lOM 

gtc ace ccg gag cag gee ttg age eac ccc gag gtc cag aga cga ctg 
val Thr pro Glu Gin Ala Leu Ser His Pro Glu Val Gin Arg Arg Leu 
1100 1105 111° 

gae ate gee eta cge aac etc etc gee atg act gat aag ttc ctt tta 
ASP lie Ala Leu Arg Asn Leu Leu Ala Met Thr Asp Lys Phe Leu Leu 
1120 1125 ^130 

gee ate ace tea tct gtg gac eaa att ecg tat ggg atg ega tat gtg 
Ala lie Thr Ser Ser Val Asp Gin He Pro Tyr Gly Met Arg Tyr Val 
1135 1140 11" 

gcc aaa gtc ctg aag gea act ctg gea gag aaa ttc cct gae gcc aca 
Ala Lys val Leu Lys Ala Thr Leu Ala Glu Lys Phe Pro Asp Ala Thr 
1150 1155 

gac age gag gtc tat aag gtg gtc ggg aac etc ctg tae tac cge ttc 



3116 



3164 



3212 



3260 



3308 



3356 



3404 



3452 



3500 



3548 
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Asp Ser Glu Val Tyr Lys Val Val Gly Asn Leu Leu Tyr Tyr Arg Phe 
1165 1170 1175 

ctg aac cca get gtg gtg get eet gac gee tte gae att gtg gee atg 3596 
Leu Asn Pro Ala Val Val Ala Pro Asp Ala Phe Asp He Val Ala Met 
1180 1185 1190 1195 

gca get ggt gga gcc etg get gee cec eag cgc cat gee ctg ggg get 3644 
Ala Ala Gly Gly Ala Leu Ala Ala Pro Gin Arg His Ala Leu Gly Ala 
1200 1205 1210 

gtg get cag etc eta eag eac get geg get gge aag gee tte tet ggg 3692 
Val Ala Gin Leu Leu Gin His Ala Ala Ala Gly Lys Ala Phe Ser Gly 
1215 1220 1225 

eag age eag eac eta egg gte ctg aat gac tat ctg gag gaa aca cac 3740 
Gin Ser Gin His Leu Arg Val Leu Asn Asp Tyr Leu Glu Glu Thr His 
1230 1235 1240 

etc aag tte agg aag tte ate cat aga gcc tgc cag gtg cca gag cca 3788 
Leu Lys Phe Arg Lys Phe He His Arg Ala Cys Gin Val Pro Glu Pro 
1245 1250 1255 

gag gag cgt ttt gca gtg gac gag tac tea gac atg gtg get gtg gcc 3836 
Glu Glu Arg Phe Ala Val Asp Glu Tyr Ser Asp Met Val Ala Val Ala 
1260 1265 1270 1275 

aaa cce atg gtg tac ate ace gtg ggg gag ctg gte aac acg cae agg 3884 
Lys Pro Met Val Tyr lie Thr Val Gly Glu Leu Val Asn Thr His Arg 
1280 1285 1290 

etg ttg etg gag cae eag gae tgc att gee eet gat eac caa gae cce 3932 
Leu Leu Leu Glu His Gin Asp Cys He Ala Pro Asp His Gin Asp Pro 
1295 1300 1305 



etg eat gag etc ctg gag gat ett ggg gag 
Leu His Glu Leu Leu Glu Asp Leu Gly Glu 
1310 1315 

ctt att ggt gag age ate get gca gat ggg 



etg cce ace ate eet gae 3980 
Leu Pro Thr He Pro Asp 
1320 

cac aca gac ctg age aag 4028 
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Leu He Gly Glu Ser He Ala Ala Asp Gly His Thr Asp Leu Ser Lys 
1325 1330 1335 

eta gaa gtg tec etg acg ctg acc aac aag ttt gaa gga eta gag gea 4076 
Leu Glu Val Ser Leu Thr Leu Thr Asn Lys Phe Glu Gly Leu Glu Ala 
1340 1345 1350 1355 

gat get gat gac tec aac acc cgt age ctg ctt ctg age acc aag cag 4124 
Asp Ala Asp Asp Ser Asn Thr Arg Ser Leu Leu Leu Ser Thr Lys Gin 
1360 1365 1370 

ctg ttg gcc gat ate ata cag ttc cat cct ggg gac acc etc aag gag 4172 
Leu Leu Ala Asp He He Gin Phe His Pro Gly Asp Thr Leu Lys Glu 
1375 1380 1385 

ate ctg tec etc teg get tec aga gag caa gaa gea gcc eac aag cag 4220 
He Leu Ser Leu Ser Ala Ser Arg Glu Gin Glu Ala Ala His Lys Gin 
1390 1395 1400 

ctg atg age cga cgc cag gcc tgt aca gcc cag aca ccg gag cca etg 4268 
Leu Met Ser Arg Arg Gin Ala Cys Thr Ala Gin Thr Pro Glu Pro Leu 
1405 1410 1415 

cga cga eac cgc tea ctg aca get cac tec etc etg cca ctg gea gag 4316 
Arg Arg His Arg Ser Leu Thr Ala His Ser Leu Leu Pro Leu Ala Glu 
1420 1425 1430 1435 

aag cag egg cgc gtc etg egg aac etg cgc cga ctt gaa gcc ctg ggg 4364 
Lys Gin Arg Arg Val Leu Arg Asn Leu Arg Arg Leu Glu Ala Leu Gly 
1440 1445 1450 

ttg gtc age gee aga aat ggc tac cag ggg eta gtg gac gag ctg gcc 4412 
Leu Val Ser Ala Arg Asn Gly Tyr Gin Gly Leu Val Asp Glu Leu Ala 
1455 1460 1465 

aag gac ate cgc aac cag cac aga eac agg eac agg egg aag gea gag 44 60 

Lys Asp He Arg Asn Gin His Arg His Arg His Arg Arg Lys Ala Glu 
1470 1475 1480 

ctg gtg aag ctg cag gcc aca tta cag ggc ctg age act aag acc acc 4508 
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Leu Val Lys Leu Gin Ala Thr Leu Gin Gly Leu Ser Thr Lys Thr Thr 
1485 X490 1495 



ttc tat gag gag cag ggt gac tac tac age cag tac ate egg gee tgc 
Phe T>r Glu Glu Gin Gly Asp Tyr Tyr Ser Gin Tyr He Arg Ala Cys 
1500 1505 1510 1515 



4556 



ctg gac cac ctg gee ecc gac tec aag agt tet ggg aag ggg aag aag 
Leu Asp His Leu Ala Pro Asp Ser Lys Ser Ser Gly Lys Gly Lys Lys 
1520 1525 1530 



4604 



cag cct tet ctt cat tac act get get cag etc ctg gaa aag ggt gte 
Gin Pro Ser Leu His Tyr Thr Ala Ala Gin Leu Leu Glu Lys Gly val 
1535 1540 1545 



4652 



ttg gtg gaa att gaa gat ctt ccc gee tet cac ttc aga aac gtc ate 
Leu Val Glu He Glu Asp Leu Pro Ala Ser His Phe Arg Asn Val He 
1550 1555 1560 



4700 



ttt gac ate aeg ecg gga gat gag gca gga aag ttt gaa gta aat gee 
Phe Asp He Thr Pro Gly Asp Glu Ala Gly Lys Phe Glu Val Asn Ala 
1565 1570 1575 



4748 



aag ttc ctg ggt gtg gac atg gag cga ttt cag ctt cac tat cag gat 
Lys Phe Leu Gly Val Asp Met Glu Arg Phe Gin Leu His Tyr Gin Asp 
1580 1585 1590 1595 



4796 



etc ctg cag etc cag tat gag ggt gtg get gtc atg aaa etc ttc aac 
Leu Leu Gin Leu Gin Tyr Glu Gly Val Ala Val Met Lys Leu Phe Asn 
1600 1605 1610 



4844 



aag gee aaa gtc aat gtc aac ctt etc ate ttc etc etc aac aag aag 
Lys Ala Lys Val Asn Val Asn Leu Leu He Phe Leu Leu Asn Lys Lys 
1615 1620 1625 



4892 



ttt ttg egg aag 
Phe Leu Arg Lys 
1630 



tgacag aggcaaaggg tgctacccaa geccctctta 



4940 



cetctctgga tgctttcttt aacactaact caecaetgtg ettccctgea gaeacccaga 



5000 
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gctcaggact gggcaaggcc cagggattct caccccttcc ccagctggga ggagcttgcc 
tgcctggcca cagacagtgt atcttctaat tggctaaagt gggccttgcc cagagtccag 
ctgtgtggct tttatcatgc atgacaaacc cctggctttc ctgccagatg gattctcatc 
ccttacagct gactcttcca ggcaatttcc atagatctgc agtcctgcct ctgccacagt 
ctctctgttg tccccacatc tacccaactt cctgtactgt tgcccttctg atgttaataa 
aagcagctgt tactccc 



5060 
5120 
5180 
5240 
5300 
5317 



<210> 4 

<211> 1631 

<212> PRT 

<213> Homo sapiens 

<400> 4 

Met Glu Arg Arg Ala Ala Gly Pro Gly Trp Ala Ala Tyr Glu Arg Leu 
15 10 15 

Thr Ala Glu Glu Met Asp Glu Gin Arg Arg Gin Asn Val Ala Tyr Gin 
20 25 30 

Tyr Leu Cys Arg Leu Glu Glu Ala Lys Arg Trp Met Glu Ala Cys Leu 
35 40 45 

Lys Glu Glu Leu Pro Ser Pro Val Glu Leu Glu Glu Ser Leu Arg Asn 
50 55 60 

Gly Val Leu Leu Ala Lys Leu Gly His Cys Phe Ala Pro Ser Val Val 
65 70 75 80 

Pro Leu Lys Lys He Tyr Asp Val Glu Gin Leu Arg Tyr Gin Ala Thr 
85 90 95 

Gly Leu His Phe Arg His Thr Asp Asn He Asn Phe Trp Leu Ser Ala 
100 105 110 
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He Ala His lie Gly Leu Pro Ser Thr Phe Phe Pro Glu Thr Thr Asp 
115 120 125 

He Tyr Asp Lys Lys Asn Met Pro Arg Val val Tyr Cys He His Ala 
130 135 140 

Leu Ser Leu Phe Leu Phe Arg Leu Gly Leu Ala Pro Gin He His Asp 
145 150 155 160 

Leu Tyr Gly Lys Val Lys Phe Thr Ala Glu Glu Leu Ser Asn Met Ala 
165 170 175 

Ser Glu Leu Ala Lys Tyr Gly Leu Gin Leu Pro Ala Phe Ser Lys He 
180 185 190 

Gly Gly He Leu Ala Asn Glu Leu Ser Val Asp Glu Ala Ala Val His 
195 200 205 

Ala Ala Val Leu Ala He Asn Glu Ala Val Glu Arg Gly Val Val Glu 
210 215 220 

Asp Thr Leu Ala Ala Leu Gin Asn Pro Ser Ala Leu Leu Glu Asn Leu 
225 230 , 235 240 

Arg Glu Pro Leu Ala Ala Val Tyr Gin Glu Met Leu Ala Gin Ala Lys 
245 250 255 • 

Met Glu Lys Ala Ala Asn Ala Arg Asn His Asp Asp Arg Glu Ser Gin 
260 265 270 

Asp He Tyr Asp His Tyr Leu Thr Gin Ala Glu He Gin Gly Asn He 
275 280 285 

Asn His val Asn Val His Gly Ala Leu Glu Val Val Asp Asp Ala Leu 
290 295 300 



Glu Arg Gin Ser Pro Glu Ala Leu Leu Lys Ala Leu Gin Asp Pro Ala 
305 310 315 320 
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Leu Ala X>eu Arg Gly Val Arg Arg Asp Phe Ala Asp Trp Tyr Leu Glu 
325 330 335 

Gin Leu Asn Ser Asp Arg Glu Gin Lys Ala Gin Glu Leu Gly Leu Val 
340 345 350 

Glu Leu Leu Glu Lys Glu Glu Val Gin Ala Gly Val Ala Ala Ala Asn 
355 360 365 

Thr Lys Gly Asp Gin Glu Gin Ala Met Leu His Ala Val Gin Arg lie 
370 375 380 

Asn Lys Ala lie Arg Arg Gly Val Ala Ala Asp Thr Val Lys Glu Leu 
385 390 395 400 

Met Cys Pro Glu Ala Gin Leu Pro Pro Val Tyr Pro Val Ala Ser Ser 
405 410 415 

Met Tyr Gin Leu Glu Leu Ala Val Leu Gin Gin Gin Gin Gly Glu Leu 
420 425 430 

Gly Gin Glu Glu Leu Phe Val Ala Val Glu Met Leu Ser Ala Val Val 
435 440 . 445 

Leu lie Asn Arg Ala Leu Glu Ala Arg Asp Ala Ser Gly Phe Trp Ser 
450 455 460 

Ser Leu Val Asn Pro Ala Thr Gly Leu Ala Glu Val Glu Gly Glu Asn 
465 470 475 480 

Ala Gin Arg Tyr Phe Asp Ala Leu Leu Lys Leu Arg Gin Glu Arg Gly 
485 490 495 

Met Gly Glu Asp Phe Leu Ser Trp Asn Asp Leu Gin Ala Thr Val Ser 
500 505 510 

Gin Val Asn Ala Gin Thr Gin Glu Glu Thr Asp Arg Val Leu Ala Val 
515 520 525 

Ser Leu He Asn Glu Ala Leu Asp Lys Gly Ser Pro Glu Lys Thr Leu 
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530 535 540 

Ser Ala Leu Leu Leu Pro Ala Ala Gly Leu Asp Asp Val Ser Leu Pro 
545 550 555 560 

val Ala Pro Arg Tyr His Leu Leu Leu Val Ala Ala Lys Arg Gin Lys 
565 570 575 

Ala Gin Val Thr Gly Asp Pro Gly Ala Val Leu Trp Leu Glu Glu lie 
580 585 590 

Arg Gin Gly Val Val Arg Ala Asn Gin Asp Thr Asn Thr Ala Gin Arg 
595 600 605 

Met Ala Leu Gly val Ala Ala He Asn Gin Ala He Lys Glu Gly Lys 
610 615 620 

Ala Ala Gin Thr Glu Arg Val Leu Arg Asn Pro Ala Val Ala Leu Arg 
625 630 635 640 

Gly Val Val Pro Asp Cys Ala Asn Gly Tyr Gin Arg Ala Leu Glu Ser 
645 650 655 

Ala Met Ala Lys Lys Gin Arg Pro Ala Asp Thr Ala Phe Trp Val Gin 
660 665 670 

His Asp Met Lys Asp Gly Thr Ala Tyr Tyr Phe His Leu Gin Thr Phe 
675 680 685 

Gin Gly He Trp Glu Gin Pro Pro Gly Cys Pro Leu Asn Thr Ser His 
690 695 700 

Leu Thr Arg Glu Glu He Gin Ser Ala Val Thr Lys Val Thr Ala Ala 
705 710 715 720 

Tyr Asp Arg Gin Gin Leu Trp Lys Ala Asn Val Gly Phe Val He Gin 
725 730 735 

Leu Gin Ala Arg Leu Arg Gly Phe Leu Val Arg Gin Lys Phe Ala Glu 
740 745 750 
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His Ser His Phe Leu Arg Thr Trp Leu Pro Ala Val He Lys He Gin 
755 760 765 

Ala His Trp Arg Gly Tyr Arg Gin Arg Lys He Tyr Leu Glu Trp Leu 
770 775 780 

Gin Tyr Phe Lys Ala Asn Leu Asp Ala He He Lys He Gin Ala Trp 
785 790 795 800 

Ala Arg Met Trp Ala Ala Arg Arg Gin Tyr Leu Arg Arg Leu His Tyr 
805 810 815 

Phe Gin Lys Asn Val Asn Ser He Val Lys He Gin Ala Phe Phe Arg 
820 825 830 

Ala Arg Lys Ala Gin Asp Asp Tyr Arg He Leu Val His Ala Pro His 
835 840 845 

Pro Pro Leu Ser Val Val Arg Arg Phe Ala His Leu Leu Asn Gin Ser 
850 855 860 

Gin Gin Asp Phe Leu Ala Glu Ala Glu Leu Leu Lys Leu Gin Glu Glu 
865 870 875 880 

Val Val Arg Lys He Arg Ser Asn Gin Gin Leu Glu Gin Asp Leu Asn 
885 890 895 

He Met Asp He Lys He Gly Leu Leu Val Lys Asn Arg He Thr Leu 
900 905 910 

Gin Glu Val Val Ser His Cys Lys Lys Leu Thr Lys Arg Asn Lys Glu 
915 920 925 

Gin Leu Ser Asp Met Met Val Leu Asp Lys Gin Lys Gly Leu Lys Ser 
930 935 940 

Leu Ser Lys Glu Lys Arg Gin Lys Leu Glu Ala Tyr Gin His Leu Phe 
945 950 955 960 
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Tyr Leu Leu Gin Thr Gin Pro lie Tyr Leu Ala Lys Leu lie Phe Gin 
965 970 975 

Met Pro Gin Asn Lys Thr Thr Lys Phe Met Glu Ala Val He Phe Ser 
980 985 990 

Leu Tyr Asn Tyr Ala Ser Ser Arg Arg Glu Ala Tyr Leu Leu Leu Gin 
995 1000 1005 

Leu Phe Lys Thr Ala Leu Gin Glu Glu He Lys Ser Lys Val Glu Gin 
1010 1015 1020 

Pro Gin Asp Val Val Thr Gly Asn Pro Thr Val Val Arg Leu Val Val 
1025 1030 1035 1040 

Arg Phe Tyr Arg Asn Gly Arg Gly Gin Ser Ala Leu Gin Glu He Leu 
1045 1050 1055 

Gly Lys val He Gin Asp Val Leu Glu Asp Lys Val Leu Ser Val His 
1060 1065 1070 

Thr Asp Pro Val His Leu Tyr Lys Asn Trp He Asn Gin Thr Glu Ala 
1075 1080 1085 

Gin Thr Gly Gin Arg Ser His Leu Pro Tyr Asp Val Thr Pro Glu Gin 
1090 1095 1100 

Ala Leu Ser His Pro Glu Val Gin Arg Arg Leu Asp He Ala Leu Arg 
1105 1110 1H5 1120 

Asn Leu Leu Ala Met Thr Asp Lys Phe Leu Leu Ala He Thr Ser Ser 
1125 1130 1135 

val Asp Gin He Pro Tyr Gly Met Arg Tyr Val Ala Lys Val Leu Lys 
1140 1145 1150 

Ala Thr Leu Ala Glu Lys Phe Pro Asp Ala Thr Asp Ser Glu Val Tyr 
1155 1160 1165 

Lys Val Val Gly Asn Leu Leu Tyr Tyr Arg Phe Leu Asn Pro Ala Val 
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1170 11*75 1180 

Val Ala Pro Asp Ala Phe Asp lie Val Ala Met Ala Ala Gly Gly Ala 
1185 1190 1195 1200 

Leu Ala Ala Pro Gin Arg His Ala Leu Gly Ala Val Ala Gin Leu Leu 
1205 1210 1215 

Gin His Ala Ala Ala Gly Lys Ala Phe Ser Gly Gin Ser Gin His Leu 
1220 1225 1230 

Arg Val Leu Asn Asp Tyr Leu Glu Glu Thr His Leu Lys Phe Arg Lys 
1235 1240 1245 

Phe lie His Arg Ala Cys Gin Val Pro Glu Pro Glu Glu Arg Phe Ala 
1250 1255 1260 

Val Asp Glu Tyr Ser Asp Met Val Ala Val Ala Lys Pro Met Val Tyr 
1265 1270 1275 1280 

lie Thr Val Gly Glu Leu Val Asn Thr His Arg Leu Leu Leu Glu His 
1285 1290 1295 

Gin Asp Cys He Ala Pro Asp His Gin Asp Pro Leu His Glu Leu Leu 
1300 1305 1310 

Glu Asp Leu Gly Glu Leu Pro Thr He Pro Asp Leu lie Gly Glu Ser 
1315 1320 1325 

He Ala Ala Asp Gly His Thr Asp Leu Ser Lys Leu Glu Val Ser Leu 
1330 1335 1340 

Thr Leu Thr Asn Lys Phe Glu Gly Leu Glu Ala Asp Ala Asp Asp Ser 
1345 1350 1355 1360 

Asn Thr Arg Ser Leu Leu Leu Ser Thr Lys Gin Leu Leu Ala Asp He 
1365 1370 1375 

He Gin Phe His Pro Gly Asp Thr Leu Lys Glu He Leu Ser Leu Ser 
1380 1385 1390 
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Ala Ser Arg Glu Gin Glu Ala Ala His Lys Gin Leu Met Ser Arg Arg 
1395 1400 1405 

Gin Ala Cys Thr Ala Gin Thr Pro Glu Pro Leu Arg Arg His Arg Ser 
1410 1415 1420 

Leu Thr Ala His Ser Leu Leu Pro Leu Ala Glu Lys Gin Arg Arg Val 
1425 1430 1435 1440 

Leu Arg Asn Leu Arg Arg Leu Glu Ala Leu Gly Leu Val Ser Ala Arg 
1445 1450 1455 

Asn Gly Tyr Gin Gly Leu Val Asp Glu Leu Ala Lys Asp lie Arg Asn 
1460 1465 1470 

Gin His Arg His Arg His Arg Arg Lys Ala Glu Leu Val Lys Leu Gin 
1475 1480 1485 

Ala Thr Leu Gin Gly Leu Ser Thr Lys Thr Thr Phe Tyr Glu Glu Gin 
1490 1495 1500 

Gly Asp Tyr Tyr Ser Gin Tyr lie Arg Ala Cys Leu Asp His Leu Ala 
1505 1510 1515 1520 

Pro Asp Ser Lys Ser Ser Gly Lys Gly Lys Lys Gin Pro Ser Leu His 
1525 1530 1535 

Tyr Thr Ala Ala Gin Leu Leu Glu Lys Gly Val Leu Val Glu lie Glu 
1540 1545 1550 

Asp Leu Pro Ala Ser His Phe Arg Asn Val lie Phe Asp lie Thr Pro 
1555 1560 1565 

Gly Asp Glu Ala Gly Lys Phe Glu Val Asn Ala Lys Phe Leu Gly Val 
1570 1575 1580 

Asp Met Glu Arg Phe Gin Leu His Tyr Gin Asp Leu Leu Gin Leu Gin 
1585 1590 1595 1600 
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Tyr Glu Gly Val Ala Val Met Lys Leu Phe Asn Lys Ala Lys Val Asn 
1605 1610 1615 



Val Asn Leu Leu lie Phe Leu Leu Asn Lys Lys Phe Leu Arg Lys 



1620 



1625 



1630 



<210> 5 

<211> 3608 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (424) . . (1908) 

<223> Clone LBFLllO 

<400> 5 

ggcggcgaga gcagctgcag ctcgcatctc aggcagtacc tagaggagct gccggtgcct 60 

cctcagaaca tctcctgatc gctacccagg accaggcacc aaggacaggg agtcccaggc 120 

gcacaccccc cattctgggt cccccaggcc cagaccccca ctctgccaca ggttgcatct 180 

tgacctggtc ctcctgcaga agtggcccct gtggtcctgc tctgagactc gtccctgggc 240 

gcccctgcag cccctttcta tgactccatc tggatttggc tggctgtggg gacgcggtcc 300 

gaggggcggc ctggctctca gcgtggtggc agccagctct ctggccacca tggcaaatgc 360 

tgagatctga ggggacaagg ctctacagcc tcagccaggg gcactcagct gttgcagggt 420 

gtg atg gag aac aaa get atg tac eta cac acc gtc age gac tgt 465 



Met Glu Asn Lys Ala Met Tyr Leu His Thr Val Ser Asp Cys 
15 10 



gac acc age tec ate tgt gag gat tec ttt gat ggc agg age ctg tec 



513 
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Asp Thr Ser Ser lie Cys Glu Asp Ser Phe Asp Gly Arg Ser Leu Ser 
15 20 25 30 

aag ctg aac ctg tgt gag gat ggt cca tgt cac aaa egg egg gca age 561 
Lys Leu Asn Leu Cys Glu Asp Gly Pro Cys His Lys Arg Arg Ala Ser 
35 40 45 

ate tgc tgt ace cag ctg ggg tec ctg teg gee ctg aag eat get gte 609 
lie Cys Cys Thr Gin Leu Gly Ser Leu Ser Ala Leu Lys His Ala Val 
50 55 60 

ctg ggg etc tac ctg ctg gte ttc ctg att ctt gtg ggc ate tte ate 657 
Leu Gly Leu Tyr Leu Leu Val Phe Leu lie Leu Val Gly lie Phe lie 
65 70 75 

tta gca gtg tec agg ccg cgc age tec cct gac gac ctg aag gee ctg 705 
Leu Ala Val Ser Arg Pro Arg Ser Ser Pro Asp Asp Leu Lys Ala Leu 
80 85 90 

act cgc aat gtg aac egg ctg aat gag age ttc egg gac ttg cag ctg 753 
Thr Arg Asn Val Asn Arg Leu Asn Glu Ser Phe Arg Asp Leu Gin Leu 
95 100 105 110 

egg ctg ctg cag get ccg ctg caa gcg gac ctg acg gag cag gtg tgg 801 
Arg Leu Leu Gin Ala Pro Leu Gin Ala Asp Leu Thr Glu Gin Val Trp 
115 120 125 

aag gtg cag gac gcg ctg cag aac eag tea gac teg ttg ctg gcg ctg 849 
Lys Val Gin Asp Ala Leu Gin Asn Gin Ser Asp Ser Leu Leu Ala Leu 
130 135 140 

gcg ggc gca gtg cag egg ctg gag ggc gcg ctg tgg ggg ctg cag gcg 897 
Ala Gly Ala Val Gin Arg Leu Glu Gly Ala Leu Trp Gly Leu Gin Ala 
145 150 155 

cag gcg gtg cag ace gag eag gcg gtg gee ctg ctg egg gac cgc aeg 945 
Gin Ala val Gin Thr Glu Gin Ala Val Ala Leu Leu Arg Asp Arg Thr 
160 165 170 

ggc cag cag age gac acg gcg cag ctg gag etc tac cag ctg cag gtg 993 



-39- 



wo 2004/035789 



Sequence Listing 



Gly Gin Gin Ser Asp Thr Ala Gin Leu Glu Leu Tyr Gin Leu Gin Val 
175 180 185 190 



PCT/KR2003/002161 



gag age aac agt age eag ctg ctg ctg agg cgc cae gcg gge etg ctg 
Glu Ser Asn Ser Ser Gin Leu Leu Leu Arg Arg His Ala Gly Leu Leu 
195 200 205 



1041 



gae ggg ctg gcg cgc agg gtg ggc ate ctg ggc gag gag ctg gcc gac 
Asp Gly Leu Ala Arg Arg Val Gly lie Leu Gly Glu Glu Leu Ala Asp 
210 215 220 



1089 



gtg ggc ggc gtg ctg cgc ggc etc aac cae age etg tec tac gac gtg 
Val Gly Gly Val Leu Arg Gly Leu Asn His Ser Leu Ser Tyr Asp Val 
225 230 235 



1137 



gee etc cae cgc acg egg ctg cag gac ctg egg gtg ctg gtg age aac 
Ala Leu His Arg Thr Arg Leu Gin Asp Leu Arg Val Leu Val Ser Asn 
240 245 250 



1185 



gee age gag gae acg cgc cgc ctg cgc etg gcg cac gta ggc atg gag 
Ala Ser Glu Asp Thr Arg Arg Leu Arg Leu Ala His Val Gly Met Glu 
255 260 265 270 



1233 



ctg cag ctg aag cag gag ctg gcc atg etc aac gcg gtc ace gag gac 
Leu Gin Leu Lys Gin Glu Leu Ala Met Leu Asn Ala Val Thr Glu Asp 
275 280 285 



1281 



ctg cgc etc aag gac tgg gag cac tec ate gca ctg egg aac ate tec 
Leu Arg Leu Lys Asp Trp Glu His Ser He Ala Leu TVrg Asn He Ser 
290 295 300 



1329 



etc gcg aaa ggg cea ccg gga ecc aaa ggt gat cag ggg cat gaa gga 
Leu Ala Lys Gly Pro Pro Gly Pro Lys Gly Asp Gin Gly His Glu Gly 
305 310 315 



1377 



aag gaa ggc agg cct ggc ate cct gga ttg cet gga ctt cga ggt ctg 
Lys Glu Gly Arg Pro Gly He Pro Gly Leu Pro Gly Leu Arg Gly Leu 
320 325 330 



1425 



cec ggg gag aga ggt acc cea gga ttg ccc ggg ccc aag ggc gat gat 



1473 
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Pro Gly Glu Arg Gly Thr Pro GXy Leu Pro Gly Pro Lys Gly Asp Asp 
335 340 345 350 
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ggg aag ctg ggg gcc aca gga cca atg ggc atg cgt ggg ttc aaa ggt 
Gly Lys Leu Gly Ala Thr Gly Pro Met Gly Met Arg Gly Phe Lys Gly 
355 360 365 



1521 



gac cga ggc cca aaa gga gag aaa gga gag aaa gga gac aga get ggg 
Asp Arg Gly Pro Lys Gly Glu Lys Gly Glu Lys Gly Asp Arg Ala Gly 
370 375 380 



1569 



gat gcc agt ggc gtg gag gcc ccg atg atg ate cgc ctg gtg aat ggc 
Asp Ala Ser Gly Val Glu Ala Pro Met Met lie Arg Leu Val Asn Gly 
385 390 395 



1617 



tea ggt ccg cac gag ggc cgc gtg gaa gtg tac cac gac egg cgt tgg 
Ser Gly Pro His Glu Gly Arg Val Glu Val Tyr His Asp Arg Arg Trp 
400 405 410 



1665 



ggc acc gtg tgt gac gac ggc tgg gac aag aag gac gga gac gtg gtg 
Gly Thr Val Cys Asp Asp Gly Trp Asp Lys Lys Asp Gly Asp Val Val 
415 420 425 430 



1713 



tgc cgc atg etc ggc ttc cgc ggt gtg gag gag gtg tac cgc aca get 
Cys Arg Met Leu Gly Phe Arg Gly Val Glu Glu Val Tyr Arg Thr Ala 
435 440 445 



1761 



cga ttc ggg caa ggc act ggg agg ate tgg atg gat gac gtt gcc tgc 
Arg Phe Gly Gin Gly Thr Gly Arg He Trp Met Asp Asp Val Ala Cys 
450 455 460 



1809 



aag ggc aca gag gaa acc ate ttc cgc tgc age ttc tec aaa tgg ggg 
Lys Gly Thr Glu Glu Thr He Phe Arg Cys Ser Phe Ser Lys Trp Gly 
465 470 475 



1857 



gtg aca aac tgt gga cat gcc gaa gat gcc age gtg aca tgc aac aga 
Val Thr Asn Cys Gly His Ala Glu Asp Ala Ser Val Thr Cys Asn Arg 
480 485 490 



1905 



cac 



tg aaagtgggca gagcccaagt tcggggtcct gcacagagca cccttcctgc 



1960 
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His 
495 



atccctgggg tggggcacag ctcggggcca ccctgaccat gcctcgacca caccccgtcc 2020 

agcattctca gtcctcacac ctgcatccca ggaccgtggg ggccggtcgt catttccctc 2080 

ttgaacatgt gctccgaagt ataactctgg gacctactgc ccgtctctct cttccaccag 2140 

gttcctgcat gaggagccct gatcaactgg atcaccactt tgcccagcct ctgaacacca 2200 

tgcaccaggc ctcaatatcc cagttccctt tggcctttta gttacaggtg aat get gaga 2260 

atgtgtcaga gacaagtgca gcagcagcga tggttggtag tatagatcat ttactcttca 2320 

gacaattccc aaacctccat tagtccaaga gtttctacat cttcctcccc agcaagaggc 2380 

aacgtcaagt gatgaatttc ccccctttac tctgcctctg ctccccattt gctagtttga 2440 

ggaagtgaca tagaggagaa gccagctgta ggggcaagag ggaaatgcaa gtcacctgca 2500 

ggaatccagc tagatttgga gaagggaatg aaactaacat tgaatgacta ccatggcacg 2560 

ctaaatagta tcttgggtgc caaattcatg tatccactta gctgcattgg tccagggcat 2620 

gtcagtctgg atacagcctt acctccaggt agcacttaac tggtccattc acctagactg 2680 

caagtaagaa gacaaaatga ctgagaccgt gtgcccacct gaacttattg tctttacttg 2740 

gcctgagcta aaagcttggg tgcaggacct gtgtaactag aaagttgcct acttcagaac 2800 

ctccagggcg tgagtgcaag gtcaaacatg actggcttcc aggccgacca tcaatgtagg 2860 

aggagagctg atgtggaggg tgacatgggg gctgcccatg ttaaacctga gtccagtgct 2920 

ctggcattgg gcagtcacgg ttaaagccaa gtcatgtgtg tctcagctgt ttggaggtga 2980 

tgattttgca tcttccaagc ctcttcaggt gtgaatctgt ggtcaggaaa acacaagtcc 3040 

taatggaacc cttagggggg aaggaaatga agattcccta taacctctgg gggtggggag 3100 
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taggaataag gggcttgggc ctccataaat ctgcaatctg caccctcctc ctagagacag 3160 

ggagatcgtg ttctgctttt tacatgagga gcagaactgg gccatacaca tgttcaagaa 3220 

ctaggggagc tacctggtag caagtgagtg cagacccacc tcaccttggg ggaatctcaa 3280 

actcataggc ctcagataca cgatcacctg tcatatcagg tgagcactgg cctgcttggg 3340 

gagagacctg ggcccctcca ggtgtaggaa cagcaacact cctggctgac aactaagcca 3400 

atatggccct aggtcattct tgcttccaat atgcttgcca ctccttaaat gtcctaatga 3460 

tgagaaactc tctttctgac caattgctat gtttacataa cacgcatgta ctcatgcatc 3520 

ccttgccaga gcccatatat gtatgcatat ataaacatag cactttttac tacatagctc 3580 

agcacattgc aaggtttgca tttaagtt 3608 

<210> 6 

<211> 495 

<212> PRT 

<213> Homo sapiens 

<400> 6 

Met Glu Asn Lys Ala Met Tyr Leu His Thr Val Ser Asp Cys Asp Thr 
15 10 15 

Ser Ser He Cys Glu Asp Ser Phe Asp Gly Arg Ser Leu Ser Lys Leu 
20 25 30 

Asn Leu Cys Glu Asp Gly Pro Cys His Lys Arg Arg Ala Ser He Cys 
35 40 45 

Cys Thr Gin Leu Gly Ser Leu Ser Ala Leu Lys His Ala Val Leu Gly 
50 55 60 

Leu Tyr Leu Leu Val Phe Leu He Leu Val Gly He Phe He Leu Ala 
65 70 75 80 
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Val Ser Arg Pro Arg Ser Ser Pro Asp Asp Leu Lys Ala Leu Thr Arg 
85 90 95 

Asn val Asn Arg Leu Asn Glu Ser Phe Arg Asp Leu Gin Leu Arg Leu 
100 105 110 

Leu Gin Ala Pro Leu Gin Ala Asp Leu Thr Glu Gin Val Trp Lys Val 
115 120 125 

Gin Asp Ala Leu Gin Asn Gin Ser Asp Ser Leu Leu Ala Leu Ala Gly 
130 135 140 

Ala Val Gin Arg Leu Glu Gly Ala Leu Trp Gly Leu Gin Ala Gin Ala 
145 150 155 160 

Val Gin Thr Glu Gin Ala Val Ala Leu Leu Arg Asp Arg Thr Gly Gin 
165 170 175 

Gin Ser Asp Thr Ala Gin Leu Glu Leu Tyr Gin Leu Gin Val Glu Ser 
180 185 190 

Asn Ser Ser Gin Leu Leu Leu Arg Arg His Ala Gly Leu Leu Asp Gly 
195 200 205 

Leu Ala Arg Arg Val Gly lie Leu Gly Glu Glu Leu Ala Asp Val Gly 
210 215 220 

Gly Val Leu Arg Gly Leu Asn His Ser Leu Ser Tyr Asp Val Ala Leu 
225 230 235 240 

His Arg Thr Arg Leu Gin Asp Leu Arg Val Leu Val Ser Asn Ala Ser 
245 250 255 

Glu Asp Thr Arg Arg Leu Arg Leu Ala His Val Gly Met Glu Leu Gin 
260 265 270 

Leu Lys Gin Glu Leu Ala Met Leu Asn Ala Val Thr Glu Asp Leu Arg 
275 280 285 
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Leu Lys Asp Trp Glu His Ser lie Ala Leu Arg Asn lie Ser Leu Ala 
290 295 300 

Lys Gly Pro Pro Gly Pro Lys Gly Asp Gin Gly His Glu Gly Lys Glu 
305 310 315 320 

Gly Arg Pro Gly lie Pro Gly Leu Pro Gly Leu Arg Gly Leu Pro Gly 
325 330 335 

Glu Arg Gly Thr Pro Gly Leu Pro Gly Pro Lys Gly Asp Asp Gly Lys 
340 345 350 

Leu Gly Ala Thr Gly Pro Met Gly Met Arg Gly Phe Lys Gly Asp Arg 
355 360 365 

Gly Pro Lys Gly Glu Lys Gly Glu Lys Gly Asp Arg Ala Gly Asp Ala 
370 375 380 

Ser Gly Val Glu Ala Pro Met Met lie Arg Leu Val Asn Gly Ser Gly 
385 390 395 400 

Pro His Glu Gly Arg Val Glu Val Tyr His Asp Arg Arg Trp Gly Thr 
405 410 415 

Val Cys Asp Asp Gly Trp Asp Lys Lys Asp Gly Asp Val Val Cys Arg 
420 425 430 

Met Leu Gly Phe Arg Gly Val Glu Glu Val Tyr Arg Thr Ala Arg Phe 
435 440 445 

Gly Gin Gly Thr Gly Arg lie Trp Met Asp Asp Val Ala Cys Lys Gly 
450 455 460 

Thr Glu Glu Thr He Phe Arg Cys Ser Phe Ser Lys Trp Gly Val Thr 
465 470 475 480 

Asn Cys Gly His Ala Glu Asp Ala Ser Val Thr Cys Asn Arg His 
485 490 495 
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<210> 7 

<211> 3162 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (405) . . (1835) 

<223> Clone LBFL123 

<400> 7 

tggccggggt gccccgcaaa gagagacaag ggagaaaaag acagcaggga aaaccccggg 60 

gaggagaagg cgaaagagaa ggtggagctc agagaagggg gccggctccc cagctccatg 120 

tggccgccgc cgctgcgggt ctgtgggggc agagggcggc ggctcccagg gcagcgcgta 180 

gcgggaccga ttgcctaata ctccggcagg ggccggggcc gcagctggct cggataaata 240 

gccgcccggc tggcccggag ctgcagggga gagcggcggc cgcgatcccc accacaccac 300 

cagcccggcc gcacggggca ctgagccggg tgctgagcac cggaggcccc gccgaggccg 360 

ggactcagga cctgcagaga aacgcctcct gattttgtct taca atg gaa ctt 413 

Met Glu Leu 
1 

aaa aag teg cct gac ggt gga tgg ggc tgg gtg att gtg ttt gtc tec 4 61 

Lys Lys Ser Pro Asp Gly Gly Trp Gly Trp Val lie Val Phe Val Ser 
5 10 15 

tte ctt act eag ttt ttg tgt tac gga tee eca eta get gtt gga gtc 509 
Phe Leu Thr Gin Phe Leu Cys Tyr Gly Ser Pro Leu Ala Val Gly Val 
20 25 30 35 

ctg tac ata gaa tgg etg gat gcc ttt ggt gaa gga aaa gga aaa aca 557 
Leu Tyr lie Glu Trp Leu Asp Ala Phe Gly Glu Gly Lys Gly Lys Thr 
40 45 50 



-46- 



wo 2004/035789 



Sequence Listing 



PCT/KR2003/002161 



gcc tgg gtt gga tec ctg gca agt gga gtt ggc ttg ctt gca agt cct 
Ala Trp Val Gly Ser Leu Ala Ser Gly Val Gly Leu Leu Ala Ser Pro 
55 60 65 



605 



gtc tgc agt etc tgt gtc tea tct ttt gga gea aga cet gtc aca ate 
Val Cys Ser Leu Cys Val Ser Ser Phe Gly Ala Arg Pro Val Thr He 
70 75 80 



653 



ttc agt ggc ttc atg gtg get gga ggc ctg atg ttg age agt ttt get 
Phe Ser Gly Phe Met Val Ala Gly Gly Leu Met Leu Ser Ser Phe Ala 
85 90 95 



701 



eec aat ate tac ttt ctg ttt ttt tec tat ggc att gtt gta ggt tea 
Pro Asn lie Tyr Phe Leu Phe Phe Ser Tyr Gly lie Val Val Gly Ser 
100 105 110 115 



749 



age gtt ggc ctt ttc ata tat get get ctg cag agg atg ctg gtt gag 
Ser Val Gly Leu Phe He Tyr Ala Ala Leu Gin Arg Met Leu Val Glu 
120 125 130 



797 



ttc tat gga ctg gat gga tgc ttg ctg att gtg ggt get tta get tta 
Phe Tyr Gly Leu Asp Gly Cys Leu Leu lie Val Gly Ala Leu Ala Leu 
135 140 145 



845 



aat ata tta gee tgt ggc agt ctg atg aga eec etc caa tct tct gat 
Asn He Leu Ala Cys Gly Ser Leu Met Arg Pro Leu Gin Ser Ser Asp 
150 155 160 



693 



tgt cet ttg cct aaa aaa ata get cca gaa gat eta eea gat aaa tac 
Cys Pro Leu Pro Lys Lys He Ala Pro Glu Asp Leu Pro Asp Lys Tyr 
165 170 175 



941 



tec att tac aat gaa aaa gga aag aat ctg gaa gaa aac ata aac att 
Ser He Tyr Asn Glu Lys Gly Lys Asn Leu Glu Glu Asn He Asn He 
180 185 190 195 



989 



ctt gac aag age tac agt agt gag gaa aaa tgc agg ate acg tta gcc 
Leu Asp Lys Ser Tyr Ser Ser Glu Glu Lys Cys Arg He Thr Leu Ala 
200 205 210 



1037 
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aat ggt gac tgg aaa caa gac age eta ett cat aaa aac cce aea gtg 
Asn Gly Asp Trp Lys Gin Asp Ser Leu Leu His Lys Asn Pro Thr Val 
215 220 225 



1085 



aea cac aea aaa gag cct gaa acg tae aaa aag aaa gtt gca gaa eag 
Thr His Thr Lys Glu Pro Glu Thr Tyr Lys Lys Lys Val Ala Glu Gin 
230 235 240 



1133 



aea tat ttt tgc aaa cag ett gee aag agg aag tgg eag tta tat aaa 
Thr Tyr Phe Cys Lys Gin Leu Ala Lys Arg Lys Trp Gin Leu Tyr Lys 
245 250 255 



1181 



aac tac tgt ggt gaa act gtg get ett ttt aaa aac aaa gta ttt tea 
Asn Tyr Cys Gly Glu Thr Val Ala Leu Phe Lys Asn Lys Val Phe Ser 
260 265 270 275 



1229 



gee ett ttc att get ate tta etc ttt gac ate gga ggg ttt eca cct 
Ala Leu Phe lie Ala lie Leu Leu Phe Asp lie Gly Gly Phe Pro Pro 
280 285 290 



1277 



tea tta ett atg gaa gat gta gca aga agt tea aac gtg aaa gaa gaa 
Ser Leu Leu Met Glu Asp Val Ala Arg Ser Ser Asn Val Lys Glu Glu 
295 300 305 



1325 



gag ttt att atg cea ett att tec att ata ggc att atg aea gca gtt 
Glu Phe He Met Pro Leu He Ser He He Gly He Met Thr Ala Val 
310 315 320 



1373 



ggt aaa ctg ett tta ggg ata ctg get gac ttc aag tgg att aat acc 
Gly Lys Leu Leu Leu Gly He I<eu Ala Asp Phe Lys Trp He Asn Thr 
325 330 335 



1421 



ttg tat ett tat gtt get ace tta ate ate atg ggc eta gee ttg tgt 
Leu Tyr Leu Tyr Val Ala Thr Leu He He Met Gly Leu Ala Leu Cys 
340 345 350 355 



1469 



gca att cea ttt gee aaa age tat gte aea ttg gcg ttg ett tct ggg 
Ala He Pro Phe Ala Lys Ser Tyr Val Thr Leu Ala Leu Leu Ser Gly 
360 365 370 



1517 
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ate eta ggg ttt ctt act ggt aat tgg tec ate ttt cca tat gtg ace 
He Leu GXy Phe Leu Thr Gly Asn Trp Ser He Ph'e Pro Tyr Val Thr 
375 380 385 



1565 



acg aag act gtg gga att gaa aaa tta gee cat gee tat ggg ata tta 
Thr Lys Thr Val Gly He Glu Lys Leu Ala His Ala Tyr Gly He Leu 
390 395 400 



1613 



atg ttc ttt get gga ctt gga aat age eta gga cca cec ate gtt ggt 
Met Phe Phe Ala Gly Leu Gly Asn Ser Leu Gly Pro Pro He Val Gly 
405 410 415 



1661 



tgg ttt tat gae tgg ace eag ace tat gat att gea ttt tat ttt agt 
Trp Phe Tyr Asp Trp Thr Gin Thr Tyr Asp He Ala Phe Tyr Phe Ser 
420 425 430 435 



1709 



ggc ttc tgc gte ctg ctg gga ggt ttt att ctg ctg ctg gea gee ttg 
Gly Phe Cys Val Leu Leu Gly Gly Phe He Leu Leu Leu Ala Ala Leu 
440 445 450 



1757 



cec tct tgg gat aca tgc aac aag caa etc ecc aag cca get cca aca 
Pro Ser Trp Asp Thr Cys Asn Lys Gin Leu Pro Lys Pro Ala Pro Thr 
455 460 465 



1805 



act ttc ttg tac aaa gtt gee tct aat gtt 
Thr Phe Leu Tyr Lys Val Ala Ser Asn Val 
470 475 



tagaa gaatattgga 



1850 



agacactatt tttgctattt tataccatat agcaacgata ttttaacaga tcteaagcaa 



1910 



attttctaga gtcaagacta ttttctcata gcaaaatttc acaatgactg actctgaatg 



1970 



aattattttt ttttttttat atateetatt ttttatgtag tgtatgegta geetetatet 



2030 



cgtatttttt tctatttctc ctecccacac catcaatggg actattctgt tttgctgtta 



ttcactagtt ettaacattg taaaaagttt gaceageetc agaaggcttt ctetgtgtaa 



2090 
2150 



agaagtataa tttctctgct gactccattt aatccactgc aaggcaccta gagagactgc 



2210 
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tcctatttta aaagtgatgc aagcatcatg ataagatatg tgtgaagccc actaggaaat 2270 

aaatcattct cttctctatg tttgacttgc tagtaaacag aagacttcaa gccagccagg 2330 

aaattaaagt ggcgactaaa acagccttaa gaattgcagt ggagcaaatt ggtcattttt 2390 

taaaaaaata tattttaacc tacagtcacc agttttcatt attctattta cctcactgaa 2450 

gtactcgcat gttgtttggt acccactgag caactgtttc agttcctaag gtatttgctg 2510 

agatgtgggt gaactccaaa tggagaagta gtcactgtag actttcttca tggttgacca 2570 

ctccaacctt gctcactttt gcttcttggc catccactca gctgatgttt cctgggaagt 2630 

gctaatttta cctgtttcca aattggaaac acatttctca atcattccgt tctggcaaat 2690 

gggaaacatc catttgcttt gggcacagtg gggatgggct gcaagttctt gcatatcctc 2750 

ccagtgaagc atttatttgc tactatcaga ttttaccact atcaaatata attcaagggc 2810 

agaattaaac gtgagtgtgt gtgtgtgtgt gtgtgtgtgt gctatgcatg ctctaagtct 2870 

gcatgggata tgggaatgga aaagggcaat aagaaattaa tacccttatg cagttgcatt 2930 

taaccttaag aaaaatgtcc ttgggataaa ctccaatgtt taatacattg attttttttc 2990 

taaagaaatg ggttttaaac tttggtatgc atcagaattc cctatagatc tttttgaaaa 3050 

tataggtacc tgggtatcac acatagaact tttaattctg ctggtgtagg ctgttgccca 3110 

aacatctata attttactga gctcttcaag tgattctgat aacacagcct gg 3162 

<210> 8 

<211> 477 

<212> PRT 

<213> Homo sapiens 

<400> 8 

Met Glu Leu Lys Lys Ser Pro Asp Gly Gly Trp Gly Trp Val lie Val 
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15 10 15 

Phe Val Ser Phe Leu Thr Gin Phe Leu Cys Tyr Gly Ser Pro Leu Ala 
20 25 30 

Val Gly Val Leu Tyr lie Glu Trp Leu Asp Ala Phe Gly Glu Gly Lys 
35 40 45 

Gly Lys Thr Ala Trp Val Gly Ser Leu Ala Ser Gly Val Gly Leu Leu 
50 55 60 

Ala Ser Pro Val Cys Ser Leu Cys Val Ser Ser Phe Gly Ala Arg Pro 
65 70 75 80 

Val Thr He Phe Ser Gly Phe Met Val Ala Gly Gly Leu Met Leu Ser 
85 90 95 

Ser Phe Ala Pro Asn He Tyr Phe Leu Phe Phe Ser Tyr Gly He Val 
100 105 110 

Val Gly Ser Ser Val Gly Leu Phe He Tyr Ala Ala Leu Gin Arg Met 
115 120 125 

Leu Val Glu Phe Tyr Gly Leu Asp Gly Cys Leu Leu He Val Gly Ala 
130 135 140 

Leu Ala Leu Asn He Leu Ala Cys Gly Ser Leu Met Arg Pro Leu Gin 
145 150 155 160 

Ser Ser Asp Cys Pro Leu Pro Lys Lys He Ala Pro Glu Asp Leu Pro 
165 170 175 

Asp Lys Tyr Ser He Tyr Asn Glu Lys Gly Lys Asn Leu Glu Glu Asn 
180 185 190 

He Asn He Leu Asp Lys Ser Tyr Ser Ser Glu Glu Lys Cys Arg He 
195 200 205 

Thr Leu Ala Asn Gly Asp Trp Lys Gin Asp Ser Leu Leu His Lys Asn 
210 215 220 
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Pro Thr Val Thr His Thr Lys Glu Pro Glu Thr Tyr Lys Lys Lys Val 
225 230 235 240 

Ala Glu Gin Thr Tyr Phe Cys Lys Gin Leu Ala Lys Arg Lys Trp Gin 
245 250 255 

Leu Tyr Lys Asn Tyr Cys Gly Glu Thr Val Ala Leu Phe Lys Asn Lys 
260 265 270 

Val Phe Ser Ala Leu Phe lie Ala He Leu Leu Phe Asp He Gly Gly 
275 280 285 

Phe Pro Pro Ser Leu Leu Met Glu Asp Val Ala Arg Ser Ser Asn Val 
290 295 300 

Lys Glu Glu Glu Phe He Met Pro Leu He Ser He He Gly He Met 
305 310 315 320 

Thr Ala Val Gly Lys Leu Leu Leu Gly He Leu Ala Asp Phe Lys Trp 
325 330 335 

He Asn Thr Leu Tyr Leu Tyr Val Ala Thr Leu He He Met Gly Leu 
340 345 350 

Ala Leu Cys Ala He Pro Phe Ala Lys Ser Tyr Val Thr Leu Ala Leu 
355 360 365 

Leu Ser Gly He Leu Gly Phe Leu Thr Gly Asn Trp Ser He Phe Pro 
370 375 380 

Tyr Val Thr Thr Lys Thr Val Gly He Glu Lys Leu Ala His Ala Tyr 
385 390 395 400 

Gly He Leu Met Phe Phe Ala Gly Leu Gly Asn Ser Leu Gly Pro Pro 
405 410 415 

He Val Gly Trp Phe Tyr Asp Trp Thr Gin Thr Tyr Asp He Ala Phe 
420 425 430 
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Tyr Phe Ser Gly Phe Cys Val Leu Leu Gly Gly Phe lie Leu Leu Leu 
435 440 445 

Ala A14 Leu Pro Ser Trp Asp Thr Cys Asn Lys Gin Leu Pro Lys Pro 
450 455 460 

Ala Pro Thr Thr Phe Leu Tyr Lys Val Ala Ser Asn Val 
465 470 475 



<210> 9 

<211> 4891 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (89) . . (1150) 

<223> Clone LBFL131 



<400> 9 

acgcctggtt cccgggaaga ctcgccagca ccagggggtg ggggagtgcg agctgaaagc 



60 



tgctggagag tgagcagccc tagcaggg atg gac atg atg ctg ttg gtg cag 112 

Met Asp Met Met Leu Leu Val Gin 
1 5 

ggt get tgt tgc teg aac cag tgg ctg gcg gcg gtg etc etc age ctg 160 
Gly Ala Cys Cys Ser Asn Gin Trp Leu Ala Ala Val Leu Leu Ser Leu 
10 15 20 

tgc tgc ctg eta eee tec tgc etc eeg get gga cag agt gtg gac ttc 208 
Cys Cys Leu Leu Pro Ser Cys Leu Pro Ala Gly Gin Ser Val Asp Phe 
25 30 35 40 

ccc tgg gcg gee gtg gac aac atg atg gtc aga aaa ggg gac acg gcg 256 
Pro Trp Ala Ala Val Asp Asn Met Met Val Arg Lys Gly Asp Thr Ala 
45 50 55 
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gtg ctt agg tgt tat ttg gaa gat gga get tea aag ggt gcc tgg ctg 304 
Val Leu Arg Cys Tyr Leu Glu Asp Gly Ala Ser Lys Gly Ala Trp Leu 
60 65 70 

aac egg tea agt att att ttt geg gga ggt gat aag tgg tea gtg gat 352 
Asn Arg Ser Ser He He Phe Ala Gly Gly Asp Lys Trp Ser Val Asp 
75 80 85 

cet cga gtt tea att tea aea ttg aat aaa agg gac tac age etc cag 400 
Pro Arg Val Ser He Ser Thr Leu Asn Lys Arg Asp Tyr Ser Leu Gin 
90 95 100 

ata cag aat gta gat gtg aea gat gat ggc cea tae acg tgt tct gtt 448 
He Gin Asn Val Asp Val Thr Asp Asp Gly Pro Tyr Thr Cys Ser Val 
105 110 115 120 

cag act caa eat aca ccc aga aca atg cag gtg cat eta act gtg caa 496 
Gin Thr Gin His Thr Pro Arg Thr Met Gin Val His Leu Thr Val Gin 
125 130 135 

gtt cet cet aag ata tat gae ate tea aat gat atg ace gte aat gaa 544 
Val Pro Pro Lys He Tyr Asp He Ser Asn Asp Met Thr Val Asn Glu 
140 145 150 

gga ace aae gte act ett act tgt ttg gcc act ggg aaa cea gag cet 592 
Gly Thr Asn Val Thr Leu Thr Cys Leu Ala Thr Gly Lys Pro Glu Pro 
155 160 165 

tec att tct tgg cga cac ate tec eca tea geg aaa cea ttt gaa aat 640 
Ser He Ser Trp Arg His He Ser Pro Ser Ala Lys Pro Phe Glu Asn 
170 175 180 

gga caa tat ttg gac att tat gga att aca agg gac cag get ggg gaa 688 
Gly Gin Tyr Leu Asp He Tyr Gly He Thr Arg Asp Gin Ala Gly Glu 
185 190 195 200 

tat gaa tgc agt geg gaa aat gat gtg tea ttc cea gat gtg agg aaa 736 
Tyr Glu Cys Ser Ala Glu Asn Asp Val Ser Phe Pro Asp Val Arg Lys 
205 210 215 
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gta aaa gtt gtt gtc aac ttt get cct act att cag gaa att aaa tct 
Val Lys Val VaX Val Asn Phe Ala Pro Thr lie Gin Glu lie Lys Ser 
220 225 230 



784 



ggc acc gtg acc ccc gga cgc agt ggc ctg ata aga tgt gaa ggt gca 
Gly Thr Val Thr Pro Gly Arg Ser Gly Leu lie Arg Cys Glu Gly Ala 
235 240 245 



832 



ggt gtg ccg cct cca gcc ttt gaa tgg tac aaa gga gag aag aag etc 
Gly Val Pro Pro Pro Ala Phe Glu Trp Tyr Lys Gly Glu Lys Lys Leu 
250 255 260 



880 



ttc aat ggc caa caa gga att att att caa aat ttt age aca aga tec 
Phe Asn Gly Gin Gin Gly lie lie lie Gin Asn Phe Ser Thr Arg Ser 
265 270 275 280 



928 



att etc act gtt acc aac gtg aca cag gag cac ttc ggc aat tat act 
lie Leu Thr Val Thr Asn Val Thr Gin Glu His Phe Gly Asn Tyr Thr 
285 290 295 



976 



tgt gtg get gee aac aag eta ggc aca acc aat gcg age ctg cct ctt 
Cys Val Ala Ala Asn Lys Leu Gly Thr Thr Asn Ala Ser Leu Pro Leu 
300 305 310 



1024 



aac cct cca agt aca gee cag tat gga att acc ggg age get gat gtt 
Asn Pro Pro Ser Thr Ala Gin Tyr Gly lie Thr Gly Ser Ala Asp Val 
315 320 325 



1072 



ctt ttc tec tgc tgg tac ctt gtg ttg aca ctg tee tet ttc acc age 
Leu Phe Ser Cys Trp Tyr Leu Val Leu Thr Leu Ser Ser Phe Thr Ser 
330 335 340 



1120 



ata ttc tac ctg aag aat gee att eta caa 
lie Phe Tyr Leu Lys Asn Ala lie Leu Gin 
345 350 



taaattcaaa gacccataaa 



1170 



aggcttttaa ggattetetg aaagtgctga tggctggate eaatetggta cagtttgtta 



aaagcagegt gggatataat eagcagtgct tacatgggga tgatcgcctt ctgtagaatt 



1230 
1290 
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gctcattatg taaatacttt aattctactc ttttttgatt agctacatta ccttgtgaag 1350 

cagtacacat tgtccttttt ttaagacgtg aaagctctga aattactttt agaggatatt 1410 

aattgtgatt tcatgtttgt aatctacaac ttttcaaaag cattcagtca tggtctgcta 1470 

ggttgcaggc tgtagtttac aaaaacgaat attgcagtga atatgtgatt ctttaaggct 1530 

gcaatacaag cattcagttc cctgtttcaa taagagtcaa tccacattta caaagatgca 1590 

tttttttctt ttttgataaa aaagcaaata atattgcctt cagattattt cttcaaaata 1650 

taacacatat ctagattttt ctgctcgcat gatattcagg tttcaggaat gagccttgta 1710 

atataactgg ctgtgcagct ctgcttctct ttcctgtaag ttcagcatgg gtgtgccttc 1770 

atacaataat atttttctct ttgtctccaa ctaatataaa atgttttgct aaatcttaca 1830 

atttgaaagt aaaaataaac cagagtgatc aagttaaacc atacactatc tctaagtaac 1890 

gaaggagcta ttggactgta aaaatctctt cctgcactga caatggggtt tgagaatttt 1950 

gccccacact aactcagttc ttgtgatgag agacaattta ataacagtat agtaaatata 2010 

ccatatgatt tctttagttg tagctaaatg ttagatccac cgtgggaaat tattcccttt 2070 

aaaatgacag cacagtccac tcaaaggatt gcctagcaat acagcatctt ttcctttcac 2130 

tagtccaagc caaaaatttt aagatgattt gtcagaaagg gcacaaagtc ctatcaccta 2190 

atattacaag agttggtaag cgctcatcat taattttatt ttgtggcagc taagttagta 2250 

tgacagaggc agtgctcctg tggacaggag cattttgcat attttccatc tgaaagtatc 2310 

actcagttga tagtctggaa tgcatgttat atattttaaa acttccaaaa tatattataa 2370 

caaacattct atatcggtat gtagcagacc aatctctaaa atagctaatt cttcaataaa 2430 

atctttctat atagccattt cagtgcaaac aagtaaaatc aaaaaagacc atcctttatt 2490 
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tttccttaca tgatatatgt aagatgcgat caaataaaga caaaacacca gtgatgagaa 2550 

tatcttaaga taagtaatta tcaaattatt gtgaatgtta aattatttct actataaaga 2610 

agcaaaacta catttttgaa ggaaaatgct gttactctaa cattaattta caggaatagt 2670 

ttgatggttt cactctttac taaagaaagg ccatcacctt gaaagccatt ttacaggttt 2730 

gatgaagtta ccaatttcag tacacctaaa tttctacaaa tagtcccctt ttacaagttg 2790 

taacaacaaa gaccctataa taaaattaga tacaagaaat tttgcagtgg ttatacatat 2850 

ttgagatatc tagtatgttg ccctagcagg gatggcttaa aaactgtgat tttttttctt 2910 

caagtaaaac ttagtcccaa agtacatcat aaatcaattt taactagaaa aatgaatctt 2970 

aaatgagggg acataagtat actctttcca caaaatggca ataataaggc ataaagctag 3030 
taaatctact aactgtaata aatgtatgac attattttga ttgatacatt aaaaaagagt 3090 
ttttagaaca aatatggcat ttaactttat tatttatttg cttttaagaa atattctttg 3150 
tggaattgtt gaataaacta taaaatatta ttttgtattg cagctttaaa gtggcacact 3210 
ccataataat ctacctacta gaaatagtgg tgctaccaca aaaaatgtta accatcagta 3270 
ccattgtttg ggagaaagaa acaggtcaag aatgcatatt attcagtgac cgctttccta 3330 
gagttaaaat acctcctctt tgtaaggttt gtaggtaaat tgaggtataa actatggatg 3390 
aaccaaataa ttagttcaaa gtgttgtcat gattccaaat ttgtggagtc tggtgttttt 3450 
accatagaat gtgacagaag tacagtcata gctcagtagc tatatgtatt tgcctttatg 3510 
ttagaagaga ctttcttgag tgacattttt aaatagagga ggtattcact atgtttttct 3570 
gtatcacagc agcattccta gtccttaggc cctcggacag agtgaaatca tgagtattta 3630 
tgagttcaat attgtcaaat aaggctacag tatttgcttt tttgtgtgaa tgtattgcat 3690 
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ataatgttca agtagatgat tttacattta tggacatata aaacgtctga ttaccccatt 3750 

ttatcagtcc tgactgtaca agattgttgc aatttcagaa tagcagtttt ataaattgat 3810 

ttatctttta atctataaca atttgtgtta gctgttcatt tcaggattat attttctaca 3870 

agttccactt gtgggactcc ttttgttgcc cctatttttt tttaaagaag gaagaaagaa 3930 

aaatgagtag cagtttaaaa atgagaatgg agagaaaaga aaaagaatga aaaggaaagg 3990 

cagtaaagag ggaaaaaaaa ggaaggatgg aaggaatgaa ggaaggaagg gaggaagggg 4050 

agaaggtagg aagaaagaaa ggatgagagg gaaggaagaa tcagagtatt agggtagtta 4110 

acttacacat ttgcattctt agttatactg caagtggtgt aactatgttt ttcaatgatc 4170 

gcatttgaaa cataagtcct attataccat taagttccta ttatgcagca attatataat 4230 

aaaaagtact gcccaagtta tagtaatgtg ggtgtttttg agacactaaa agatttgaga 4290 

gggagaattt caaacttaaa gccacttttg gggggtttat aacttaactg aaaaattaat 4350 

gcttcatcat aacatttaag ctatatctag aaagtagact ggagaactga gaaaattacc 4410 

caggtaattc agggaaaaaa aaaaaatata tatatatata aataccccta catttgaagt 4470 

cagaaaactc tgaaaaactg aattatcaaa gtcaatcatc tataatgatc aaatttactg 4530 

aacaattgtt aatttatcca ttgtgcttag ctttgtgaca cagccaaaag ttacctattt 4590 

aatcttttca ataaaaattg ttttttgaaa tccagaaatg atttaaaaag aggtcaggtt 4650 

tttaactatt tattgaagta tgtggatgtc cagtatttca atagatatga atatgaataa 4710 

atggtatgcc ttaagattct ttgaatatgt atttacttta aagactggaa aaagctcttc 4770 

ctgtctttta gtaaacatcc atatttcata acctgatgta aaatatgttg tactgtttcc 4830 

aataggggaa tataaactca gtttatcaat taaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4890 
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4891 



<210> 10 

<211> 354 

<212> PRT 

<213> Homo sapiens 

<400> 10 

Met Asp Met Met Leu Leu Val Gin Gly Ala Cys Cys Ser Asn Gin Trp 
15 10 15 

Leu Ala Ala Val Leu Leu Ser Leu Cys Cys Leu Leu Pro Ser Cys Leu 
2b 25 30 

Pro Ala Gly Gin Ser Val Asp Phe Pro Trp Ala Ala Val Asp Asn Met 
35 40 45 

Met Val Arg Lys Gly Asp Thr Ala Val Leu Arg Cys Tyr Leu Glu Asp 
50 55 60 

Gly Ala Ser Lys Gly Ala Trp Leu Asn Arg Ser Ser lie lie Phe Ala 
65 70 75 80 

Gly Gly Asp Lys Trp Ser Val Asp Pro Arg Val Ser lie Ser Thr Leu 
85 90 95 

Asn Lys Arg Asp Tyr Ser Leu Gin lie Gin Asn Val Asp Val Thr Asp 
100 105 110 

Asp Gly Pro Tyr Thr Cys Ser Val Gin Thr Gin His Thr Pro Arg Thr 
115 120 125 

Met Gin Val His Leu Thr Val Gin Val Pro Pro Lys He Tyr Asp He 
130 135 140 

Ser Asn Asp Met Thr Val Asn Glu Gly Thr Asn Val Thr Leu Thr Cys 
145 150 155 160 

Leu Ala Thr Gly Lys Pro Glu Pro Ser He Ser Trp Arg His He Ser 
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165 170 175 

Pro Ser Ala Lys Pro Phe Glu Asn Gly Gin Tyr Leu Asp lie Tyr Gly 
180 185 190 

lie Thr Arg Asp Gin Ala Gly Glu Tyr Glu Cys Ser Ala Glu Asn Asp 
195 200 205 

Val Ser Phe Pro Asp Val Arg Lys Val Lys Val Val Val Asn Phe Ala 
210 215 220 

Pro Thr lie Gin Glu He Lys Ser Gly Thr Val Thr Pro Gly Arg Ser 
225 230 235 240 

Gly Leu He Arg Cys Glu Gly Ala Gly Val Pro Pro Pro Ala Phe Glu 
245 250 255 

Trp Tyr Lys Gly Glu Lys Lys Leu Phe Asn Gly Gin Gin Gly He He 
260 265 270 

He Gin Asn Phe Ser Thr Arg Ser He Leu Thr Val Thr Asn Val Thr 
275 280 285 

Gin Glu His Phe Gly Asn Tyr Thr Cys Val Ala Ala Asn Lys Leu Gly 
290 295 300 

Thr Thr Asn Ala Ser Leu Pro Leu Asn Pro Pro Ser Thr Ala Gin Tyr 
305 310 315 320 

Gly He Thr Gly Ser Ala Asp Val Leu Phe Ser Cys Trp Tyr Leu Val 
325 330 335 

Leu Thr Leu Ser Ser Phe Thr Ser He Phe Tyr Leu Lys Asn Ala He 
340 345 350 

Leu Gin 



<210> 11 
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<211> 3098 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (223) (1569) 

<223> LBFL164 

<400> 11 

gtttatcgcg cacatctcgc ggcgaggagg agaggccgga agggcgcccc agccccaagg 60 

ctcctgcccc gcctgggcct ccggctttcg tttccccgca acgcttcgct ttcgtttccc 120 

gctggcgcct ggctccctcc gggtttcgtt tcccgccggc gcctggctcc cgccaggttt 180 

cgtttccgag gcggggccga gggcggcgtc gctgaggcgc cc atg gcc ttc 231 

Met Ala Phe 
1 

gcc cgc egg etc ctg cgc ggg cca ctg teg ggg ecg etg etc ggg egg 279 
Ala Arg Arg Leu Leu Arg Gly Pro Leu Ser Gly Pro Leu Leu Gly Arg 
5 10 15 

cgc ggg gtc tgc get ggg gcc atg get ecg ccg cgc cgc ttc gtc ctg 327 
Arg Gly Val Cys Ala Gly Ala Met Ala Pro Pro Arg Arg Phe Val Leu 
20 25 30 35 

gag ctt ccc gac tgc acc ctg get cac ttc gcc eta ggc gcc gac gcc 375 
Glu Leu Pro Asp Cys Thr Leu Ala His Phe Ala Leu Gly Ala Asp Ala 
40 45 50 

ccc ggc gac gca gac gcc ccc gac ccc cgc ctg gcg gcg ctg ctg ggg 423 
Pro Gly Asp Ala Asp Ala Pro Asp Pro Arg Leu Ala Ala Leu Leu Gly 
55 60 65 

ccc ccg gag cgc age tac teg ctg tgc gtg ccc gtg acc ecg gac gcc 471 
Pro Pro Glu Arg Ser Tyr Ser Leu Cys Val Pro Val Thr Pro Asp Ala 
70 75 80 
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ggc tgc ggg gcc egg gtc egg gcg geg egg etg cac cag egc etg etg 519 
Gly Cys GXy Ala Arg Val Arg Ala Ala Arg Leu His Gin Arg Leu Leu 
85 90 95 

cac cag etg egc cgc ggc ccc ttc cag egg tgc cag etg etc agg etg 567 
His Gin Leu Arg Arg Gly Pro Phe Gin Arg Cys Gin Leu Leu Arg Leu 
100 105 110 115 

etc tgc tac tgc ccg ggc ggc cag gcc ggc ggc gca cag caa ggc ttc 615 
Leu Cys Tyr Cys Pro Gly Gly Gin Ala Gly Gly Ala Gin Gin Gly Phe 
120 125 130 

etg etg cgc gac ccc etg gat gac cct gac ace egg caa gcg etg etc 663 
Leu Leu Arg Asp Pro Leu Asp Asp Pro Asp Thr Arg Gin Ala Leu Leu 
135 140 145 

gag etg etg ggc gcc tgc cag gag gca cea cgc ccg cac ttg ggc gag 711 
Glu Leu Leu Gly Ala Cys Gin Glu Ala Pro Arg Pro His Leu Gly Glu 
150 155 160 

ttc gag gcc gac ccg cgc ggc cag etg tgg cag cgc etc tgg gag gtg 759 
Phe Glu Ala Asp Pro Arg Gly Gin Leu Trp Gin Arg Leu Trp Glu Val 
165 170 175 

caa gac ggc agg egg etg cag gtg ggc tgc gca cag gtc gtg ccc gtc 807 
Gin Asp Gly Arg Arg Leu Gin Val Gly Cys Ala Gin Val Val Pro Val 
180 185 190 195 

ccg gag ccc ccg etg cac ccg gtg gtg eca gac ttg ccc agt tec gtg 855 
Pro Glu Pro Pro Leu His Pro Val Val Pro Asp Leu Pro Ser Ser Val 
200 205 210 

gtc ttc ccg gac egg gaa gee gee egg gee gtt ttg gag gag tgt ace 903 
Val Phe Pro Asp Arg Glu Ala Ala Arg Ala Val Leu Glu Glu Cys Thr 
215 220 225 

tec ttt att cct gaa gcc egg gca gtg ett gac etg gtc gac cag tgc 951 
Ser Phe lie Pro Glu Ala Arg Ala Val Leu Asp Leu Val Asp Gin Cys 
230 235 240 
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cca aaa cag ate cag aaa gga aag ttc cag gtt gtt gcc ate gaa gga 
Pro Lys Gin He Gin Lys Gly Lys Phe Gin Val Val Ala He Glu Gly 
245 250 255 



999 



ctg gat gcc acg ggt aaa aee acg gtg acc cag tea gtg gca gat tea 
Leu Asp Ala Thr Gly Lys Thr Thr Val Thr Gin Ser Val Ala Asp Ser 
260 265 270 275 



1047 



ett aag get gtc etc tta aag tea cca cec tct tgc att ggc cag tgg 
Leu Lys Ala Val Leu Leu Lys Ser Pro Pro Ser Cys He Gly Gin Trp 
280 285 290 



1095 



agg aag ate ttt gat gat gaa cca act ate att aga aga get ttt tae 
Arg Lys He Phe Asp Asp Glu Pro Thr He He Arg Arg Ala Phe Tyr 
295 300 305 



1143 



tct ttg ggc aat tat att gtg gcc tec gaa ata get aaa gaa tet gcc 
Ser Leu Gly Asn Tyr He Val Ala Ser Glu He Ala Lys Glu Ser Ala 
310 315 320 



1191 



aaa tct cct gtg att gta gae agg tac tgg cac age acg gcc acc tat 
Lys Ser Pro Val He Val Asp Arg Tyr Trp His Ser Thr Ala Thr Tyr 
325 330 335 



1239 



get ata gee act gag gtg agt ggg ggt etc eag cac ctg cee eea gee 
Ala He Ala Thr Glu Val Ser Gly Gly Leu Gin His Leu Pro Pro Ala 
340 345 350 355 



1287 



eat cac eet gtg tac eag tgg eea gag gae etg etc aaa cct gae ett 
His His Pro Val Tyr Gin Trp Pro Glu Asp Leu Leu Lys Pro Asp Leu 
360 365 370 



1335 



ate etg ctg etc act gtg agt cct gag gag agg ttg cag agg ctg cag 
He Leu Leu Leu Thr Val Ser Pro Glu Glu Arg Leu Gin Arg Leu Gin 
375 380 385 



1383 



ggc egg ggc atg gag aag aee agg gaa gaa gca gaa ett gag gcc aac 
Gly Arg Gly Met Glu Lys Thr Arg Glu Glu Ala Glu Leu Glu Ala Asn 
390 395 400 



1431 
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agt gtg ttt cgt caa aag gta gaa atg tec tac cag egg atg gag aat 1479 
Ser Val Phe Arg Gin Lys Val Glu Met Ser Tyr Gin Arg Met Glu Asn 
405 410 415 

cet ggc tgc eat gtg gtt gat gee age cec tec aga gaa agg gte etg 1527 
Pro Gly Cys His Val Val Asp Ala Ser Pro Ser Arg Glu Arg Val Leu 
420 425 430 435 

eag acg gta tta age eta ate cag aat agt ttt agt gaa ecg t 1570 

Gin Thr Val Leu Ser Leu lie Gin Asn Ser Phe Ser Glu Pro 
440 445 

agttactctg gccaggtgcc acgtctaact agattagatg ttgtttgaaa catetacatc 1630 

caccatttgt tatgcagtgt teceaaattt ctgttctaea agcatgttgt gtggeagaaa 1690 

actggagacc aggcatetta attttaette agccatcgta cectcttctg actgatggac 1750 

ccgtcatcac aaaggt;ceet eteatcatgt tccagtgaga .ggecagcgat tgctttctte 1810 

ctggcatagt aaacatttte ttggaaeata tgttteactt aatcactaee aaatatctgg 1870 

aagacctgtc ttacteagac ageaccaggt gtacagaagc agcagacaag atettccaga 1930 

teageaggga gaccccggag ectctgcttc tcctacactg geatgctgat gagatcgtga 1990 

eatgcccaca ttggettctt ecacatctgg ttgcactegt catgatgggc tcgctgcatc 2050 

teccteagtc ecaaatteta gagceaagtg ttcctgcaga ggetgtctat gtgtcctgge 2110 

tgcecaagga caeteetgea gagccatttt tgggtaagga aeacttaeaa agaaggcatt 2170 

gatcttgtgt ctgaggctca gagccctttt gataggettc tgagtcatat ataaagaeat 2230 

tcaagccaag atgetccaae tgcaaatata ccaaccttet ctgaattata ttttgettat 2290 

ttatatttct tttctttttt tetaaagtat ggctctgaat agaatgcaca ttttccattg 2350 

aaetggatgc atttcattta gccaatecag taatttattt atattaatet atacaatatg 2410 
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tttcctcagc ataggagcta tgattcatta attaaaagtg gagtcaaaac gctaaatgca 2470 

atgtttgttg tgtattttca ttacacaaac ttaatttgtc ttgttaaata agtacagtgg 2530 

atcttggagt gggatttctt ggtaaattat cttgcacttg aatgtctcat gattacatat 2590 

gaaatcgctt tgacatatct ttagacagaa aaaagtagct gagtgagggg gaaattatag 2650 

agctgtgtga ctttagggag taggttgaac caggtgatta cctaaaattc cttccagttc 2710 

aaaggcagat aaatctgtaa attattttat cctatctacc atttcttaag aagacattac 2770 

tccaaaataa ttaaatttaa ggctttatca ggtctgcata tagaatctta aattctaata 2830 

aagtttcatg ttaatgtcat aggattttta aaagagctat aggtaatttc tatataatat 2890 

gtgtatatta aaatgtaatt gatttcagtt gaaagtattt taaagctgat aaatagcatt 2950 

agggttcttt gcaatgtggt atctagctgt attattggtt ttatttactt taaacatttt 3010 

gaaaagctta tactggcagc ctagaaaaac aaacaattaa tgtatcttta tgtccctggc 3070 

acatgaataa actttgctgt ggtttact 3098 



<210> 12 

<211> 449 

<212> PRT 

<213> Homo sapiens 

<400> 12 

Met Ala Phe Ala Arg Arg Leu Leu Arg Gly Pro Leu Ser Gly Pro Leu 
15 10 15 

Leu Gly Arg Arg Gly Val Cys Ala Gly Ala Met Ala Pro Pro Arg Arg 
20 25 30 

Phe Val Leu Glu Leu Pro Asp Cys Thr Leu Ala His Phe Ala Leu Gly 
35 40 45 
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Ala Asp Ala Pro Gly Asp Ala Asp Ala Pro Asp Pro Arg Leu Ala Ala 
50 55 60 

Leu Leu Gly Pro Pro Glu Arg Ser Tyr Ser Leu Cys Val Pro Val Thr 
65 70 75 80 

Pro Asp Ala Gly Cys Gly Ala Arg Val Arg Ala Ala Arg Leu His Gin 
85 90 95 

Arg Leu Leu His Gin Leu Arg Arg Gly Pro Phe Gin Arg Cys Gin Leu 
100 105 110 

Leu Arg Leu Leu Cys Tyr Cys Pro Gly Gly Gin Ala Gly Gly Ala Gin 
115 120 125 

Gin Gly Phe Leu Leu Arg Asp Pro Leu Asp Asp Pro Asp Thr Arg Gin 
130 135 140 

Ala Leu Leu Glu Leu Leu Gly Ala Cys Gin Glu Ala Pro Arg Pro His 
145 150 155 160 

Leu Gly Glu Phe Glu Ala Asp Pro Arg Gly Gin Leu Trp Gin Arg Leu 
165 170 175 

Trp Glu Val Gin Asp Gly Arg Arg Leu Gin Val Gly Cys Ala Gin Val 
180 185 190 

Val Pro Val Pro Glu Pro Pro Leu His Pro Val Val Pro Asp Leu Pro 
195 200 205 

Ser Ser Val Val Phe Pro Asp Arg Glu Ala Ala Arg Ala Val Leu Glu 
210 215 220 

Glu Cys Thr Ser Phe lie Pro Glu Ala Arg Ala Val Leu Asp Leu Val 
225 230 235 240 

Asp Gin Cys Pro Lys Gin He Gin Lys Gly Lys Phe Gin Val Val Ala 
245 250 255 
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He Glu Gly Leu Asp Ala Thr Gly Lys Thr Thr VaX Thr Gin Ser Val 
260 265 270 

Ala Asp Ser Leu Lys Ala Val Leu Leu Lys Ser Pro Pro Ser Cys He 
275 280 285 

Gly Gin Trp Arg Lys lie Phe Asp Asp Glu Pro Thr He He Arg Arg 
290 295 300 

Ala Phe Tyr Ser Leu Gly Asn Tyr He Val Ala Ser Glu He Ala Lys 
305 310 315 320 

Glu Ser Ala Lys Ser Pro val He Val Asp Arg Tyr Trp His Ser Thr 
325 330 335 

Ala Thr Tyr Ala He Ala Thr Glu Val Ser Gly Gly Leu Gin His Leu 
340 345 350 

Pro Pro Ala His His Pro Val Tyr Gin Trp Pro Glu Asp Leu Leu Lys 
355 360 365 

Pro Asp Leu He Leu Leu Leu Thr Val Ser Pro Glu Glu Arg Leu Gin 
370 375 380 

Arg Leu Gin Gly Arg Gly Met Glu Lys Thr Arg Glu Glu Ala Glu Leu 
385 390 395 400 

Glu Ala Asn Ser Val Phe Arg Gin Lys Val Glu Met Ser Tyr Gin Arg 
405 410 415 

Met Glu Asn Pro Gly Cys His Val Val Asp Ala Ser Pro Ser Arg Glu 
420 425 430 

Arg Val Leu Gin Thr Val Leu Ser Leu He Gin Asn Ser Phe Ser Glu 
435 440 445 



Pro 
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<210> 13 

<211> 1893 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (418) . . (1392) 

<223> LBFL167 Clone #20 



<400> 13 

agtccagctg ccgttaggcg ctgggatagt cgcacgctgg atgcatctac gtccgccgag 



60 



cccctggggc gaagaggccg cgtccgcctt catttgtggc cggtgcttcg ccccctgacc 120 

cttcgccccc aaagaccagc tctaacgtga gcgcctcggc cgccctgccc cagcctcgta 180 

cacgccgcca gcctcgccca gccggtgtcc ggagaccctc gggccgtgtc catttgtggg 240 

caaagccagc ggggcaggct tggccagagt gcaccactcg gcgccgtccc aggcccgacg 300 

ctctgggcgc gcccggaacc ccaggttcgc ggcccgtgtt tccgaccggc ggagggggct 360 

cagcggcccg atcccacgga agcgcgctcg gaggggtggg acccggccgg accggag 417 

atg gcg ccg cca gcg ggc ggg gcg gcg gcg gcg gcc teg gac ttg ggc 4 65 

Met Ala Pro Pro Ala Gly Gly Ala Ala Ala Ala Ala Ser Asp Leu Gly 
15 10 15 

tec gcc gca gtg etc ttg get gtg cae gcc gcg gtg agg ccg ctg ggc 513 
Ser Ala Ala Val Leu Leu Ala Val His Ala Ala Val Arg Pro Leu Gly 
20 25 30 

gcc ggg cca gac gcc gag gca cag etg egg agg ctg eag ctg age gcg 561 
Ala Gly pro Asp Ala Glu Ala Gin Leu Arg Arg Leu Gin Leu Ser Ala 
35 40 45 



gac cct gag egg cct ggg cgc ttc egg etg gag ctg ctg ggc gcg gga 
Asp Pro Glu Arg Pro Gly Arg Phe Arg Leu Glu Leu Leu Gly Ala Gly 



609 
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50 



55 



60 



cct ggg gcg gtt aat ttg gag tgg ccc ctg gag tea gtt tec tae aec 
Pro Gly Ala Val Asn Leu Glu Trp Pro Leu Glu Ser Val Ser Tyr Thr 
65 70 75 80 



657 



ate cga ggc ccc acc cag cac gag eta cag cct cca cca gga ggg cct 
lie Arg Gly Pro Thr Gin His Glu Leu Gin Pro Pro Pro Gly Gly Pro 
85 90 95 



705 



gga acc etc age ctg cac ttc etc aac cct cag gaa get cag egg tgg 
Gly Thr Leu Ser Leu His Phe Leu Asn Pro Gin Glu Ala Gin Arg Trp 
100 105 110 



753 



gca gtc eta gtc cga ggt gee acc gtg gaa gga cag aat ggc age aag 
Ala Val Leu Val Arg Gly Ala Thr Val Glu Gly Gin Asn Gly Ser Lys 
115 120 125 



801 



age aac tea cca cca gee ttg ggc cca gaa gca tgc cct gtc tee ctg 
Ser Asn Ser Pro Pro Ala Leu Gly Pro Glu Ala Cys Pro Val Ser Leu 
130 135 140 



849 



ccc agt ccc ccg gaa gee tec aca etc aag ggc cct cca cct gag gca 
Pro Ser Pro Pro Glu Ala Ser Thr Leu Lys Gly Pro Pro Pro Glu Ala 
145 150 155 160 



897 



gat ett cct agg age cct gga aac ttg acg gag aga gaa gag ctg gca 
Asp Leu Pro Arg Ser Pro Gly Asn Leu Thr Glu Arg Glu Glu Leu Ala 
165 170 175 



945 



ggg age ctg gee egg get att gca ggt gga gae gag aag ggg gca gee 
Gly Ser Leu Ala Arg Ala lie Ala Gly Gly Asp Glu Lys Gly Ala Ala 
180 185 190 



993 



caa gtg gca gee gtc ctg gee cag cat cgt gtg gee ctg agt gtt cag 
Gin Val Ala Ala Val Leu Ala Gin His Arg Val Ala Leu Ser Val Gin 
195 200 205 



1041 



ett cag gag gee tgc ttc cca cct ggc ccc ate agg ctg cag gtc aca 
Leu Gin Glu Ala Cys Phe Pro Pro Gly Pro lie Arg Leu Gin Val Thr 



1089 
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210 215 220 

ctt gaa gac get gcc tct gcc gca tec gcc gcg tec tet gea cae gtt 1137 
Leu Glu Asp Ala Ala Ser Ala Ala Ser Ala Ala Ser Ser Ala His Val 
225 230 235 240 

gcc ctg cag gtc cae ccc cac tgc act gtt gca get etc cag gag cag 1185 
Ala Leu Gin Val His Pro His Cys Thr val Ala Ala Leu Gin Glu Gin 
245 250 255 

gtg ttc tea gag etc ggt ttc eeg cea gee gtg caa cgc tgg gtc ate 1233 
Val Phe Ser Glu Leu Gly Phe Pro Pro Ala Val Gin Arg Trp Val lie 
260 265 270 

gga egg tgc ctg tgt gtg cct gag cgc age ctt gcc tct tac ggg gtt 1281 
Gly Arg Cys Leu Cys Val Pro Glu Arg Ser Leu Ala Ser Tyr Gly Val 
275 280 285 

egg cag gat ggg gac cct get ttc etc tac ttg ctg tea get cct ega 1329 
Arg Gin Asp Gly Asp Pro Ala Phe Leu Tyr Leu Leu Ser Ala Pro Arg 
290 295 300 

gaa gcc cea ggt cag tec teg atg ggg gtg ggg tgt ggg agg tgg ggt 1377 
Glu Ala Pro Gly Gin Ser Ser Met Gly Val Gly Cya Gly Arg Trp Gly 
305 310 315 320 

gca gcc cea cag tee tgagctce aceeeetcag ceacaggace tageccteag 1430 
Ala Ala Pro Gin Ser 
325 

eacccecaga agatggaegg ggaacttgga cgettgttte ecccateatt ggggctaccc 1490 

ceaggecccc agecagetge cteeagcetg eeeagteeae tecagcecag etggtectgt 1550 

ccttcctgea ecttcatcaa tgcceeagac cgccetgget gtgagatgtg tageaeccag 1610 

aggcectgea ettgggacce eettgctgca gettccacet ageagceace agaggtaeca 1670 

gaggtggcac aggcagggga ggtggggggc cagggcagaa teeacaggaa tgacceagct 1730 
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cctcccccac aggttacaag gggagagtgg cccttccctc acaagtccga catctccagg 1790 
cccccactga actccgggga cctctactga ctgcttgctg ggacagtcac cagggttggg 1850 
gggaagggcc acaaaatgaa accattaaag acccttaaga gcc 1893 



<210> 


14 


<211> 


325 


<212> 


PRT 


<213> 


Homo 


<400> 


14 



sapiens 



Met Ala Pro Pro Ala Gly Gly Ala Ala Ala Ala Ala Ser Asp Leu Gly 
15 10 15 

Ser Ala Ala Val Leu Leu Ala Val His Ala Ala Val Arg Pro Leu Gly 
20 25 30 

Ala Gly Pro Asp Ala Glu Ala Gin Leu Arg Arg Leu Gin Leu Ser Ala 
35 40 45 

Asp Pro Glu Arg Pro Gly Arg Phe Arg Leu Glu Leu Leu Gly Ala Gly 
50 55 60 

Pro Gly Ala Val Asn Leu Glu Trp Pro Leu Glu Ser Val Ser Tyr Thr 
65 70 75 80 

lie Arg Gly Pro Thr Gin His Glu Leu Gin Pro Pro Pro Gly Gly Pro 
85 90 95 

Gly Thr Leu Ser Leu His Phe Leu Asn Pro Gin Glu Ala Gin Arg Trp 
100 105 110 

Ala Val Leu Val Arg Gly Ala Thr Val Glu Gly Gin Asn Gly Ser Lys 
115 120 125 

Ser Asn Ser Pro Pro Ala Leu Gly Pro Glu Ala Cys Pro Val Ser Leu 
130 135 140 
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Pro Ser Pro Pro Glu Ala Ser Thr Leu Lys GXy Pro Pro Pro Glu Ala 
145 150 155 160 

Asp Leu Pro Arg Ser Pro Gly Asn Leu Thr Glu Arg Glu Glu Leu Ala 
165 170 175 

Gly Ser Leu Ala Arg Ala lie Ala Gly Gly Asp Glu Lys Gly Ala Ala 
180 185 190 

Gin Val Ala Ala val Leu Ala Gin His Arg Val Ala Leu Ser Val Gin 
195 200 205 

Leu Gin Glu Ala Cys Phe Pro Pro Gly Pro He Arg Leu Gin Val Thr 
210 215 220 

Leu Glu Asp Ala Ala Ser Ala Ala Ser Ala Ala Ser Ser Ala His Val 
225 230 235 240 

Ala Leu Gin Val His Pro His Cys Thr Val Ala Ala Leu Gin Glu Gin 
245 250 255 

Val Phe Ser Glu Leu Gly Phe Pro Pro Ala Val Gin Arg Trp Val He 
260 265 270 

Gly Arg Cys Leu Cys Val Pro Glu Arg Ser Leu Ala Ser Tyr Gly Val 
275 280 285 

Arg Gin Asp Gly Asp Pro Ala Phe Leu Tyr Leu Leu Ser Ala Pro Arg 
290 295 300 

Glu Ala Pro Gly Gin Ser Ser Met Gly Val Gly Cys Gly Arg Trp Gly 
305 310 315 320 

Ala Ala Pro Gin Ser 
325 



<210> 15 
<211> 1597 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> CDS 

<222> (271).. (1431) 

<223> LBFL167 Clone #46 



<400> 15 

gtgagcgcct cggccgccct gccccagcct cgtacacgcc gccagctcgc ccagccggtg 



60 



tccggagacc ctcgggccgt gtccatttgt gggcaaagcc agcggggcag gcttggccag 120 

agtgcaccac tcggcgccgt cccaggcccg acgctctggg cgcgcccgga accccaggtt 180 

cgcggcccgt gtttccgacc ggcggagggg gctcagcggc ccgatcccac ggaagcgcgc 240 

tcggaggggt gggacccggc cggaccggag atg gcg ccg cca gcg ggc ggg gcg 294 

Met Ala Pro Pro Ala Gly Gly Ala 
1 5 

gcg gcg gcg gcc teg gac ttg ggc tec gcc gca gtg etc ttg get gtg 342 
Ala Ala Ala Ala Ser Asp Leu Gly Ser Ala Ala Val Leu Leu Ala Val 
10 15 20 

cac gcc gcg gtg agg ccg ctg ggc gcc ggg cca gac gcc gag gca cag 390 
His Ala Ala Val Arg Pro Leu Gly Ala Gly Pro Asp Ala Glu Ala Gin 
25 30 35 40 

ctg egg agg ctg cag ctg age gcg gac cet gag agg cct ggg cgc ttc 
Leu Arg Arg Leu Gin Leu Ser Ala Asp Pro Glu Arg Pro Gly Arg Phe 
45 50 55 

egg ctg gag ctg ctg ggc gcg gga cct ggg gcg gtt aat ttg gag tgg 486 
Arg Leu Glu Leu Leu Gly Ala Gly Pro Gly Ala Val Asn Leu Glu Trp 
60 65 70 



cec ctg gag tea gtt tec tac ace ate ega ggc eee ace cag cac gag 
Pro Leu Glu Ser Val Ser Tyr Thr lie Arg Gly Pro Thr Gin His Glu 
75 80 85 



438 



534 
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eta cag cct cca cca gga ggg cct gga acc etc age ctg cac ttc etc 
Leu Gin Pro Pro Pro Gly Gly Pro Gly Thr Leu Ser Leu His Phe Leu 
90 95 100 



582 



aae cct cag gaa get cag egg tgg gea gtc eta gte cga ggt gee ace 
Asn Pro Gin Glu Ala Gin Arg Trp Ala Val Leu Val Arg Gly Ala Thr 
105 110 115 120 



630 



gtg gaa gga cag aat ggc age aag age aae tea cca cca gcc ttg ggc 
Val Glu Gly Gin Asn Gly Ser Lys Ser Asn Ser Pro Pro Ala Leu Gly 
125 130 135 



678 



cca gaa gca tgc cct gte tec ctg cec agt eee ceg gaa gee tec aca 
Pro Glu Ala Cys Pro Val Ser Leu Pro Ser Pro Pro Glu Ala Ser Thr 
140 145 150 



726 



etc aag ggc cct cca cct gag gca gat ctt cct agg age cct gga aac 
Leu Lys Gly Pro Pro Pro Glu Ala Asp Leu Pro Arg Ser Pro Gly Asn 
155 160 165 



774 



ttg acg gag aga gaa gag ctg gca ggg age ctg gcc egg get att gca 
Leu Thr Glu Arg Glu Glu Leu Ala Gly Ser Leu Ala Arg Ala lie Ala 
170 175 180 



822 



ggt gga gac gag aag ggg gea gee eaa gtg gea gcc gtc ctg gee cag 
Gly Gly Asp Glu Lys Gly Ala Ala Gin Val Ala Ala Val Leu Ala Gin 
185 190 195 200 



870 



cat egt gtg gee ctg agt gtt cag ctt cag gag gcc tgc ttc cca cct 
His Arg Val Ala Leu Ser Val Gin Leu Gin Glu Ala Cys Phe Pro Pro 
205 210 215 



918 



ggc eee ate agg ctg cag gte aca ctt gaa gac get gcc tct gcc gca 
Gly Pro lie Arg Leu Gin Val Thr Leu Glu Asp Ala Ala Ser Ala Ala 
220 225 230 



966 



tec gee gcg tee tct gca cac gtt gee ctg cag gtc cac cec cac tgc 
Ser Ala Ala Ser Ser Ala His Val Ala Leu Gin Val His Pro His Cys 
235 240 245 



1014 
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act gtt gca get etc cag gag cag gtg ttc tea gag etc ggt ttc ecg 
Thr Val Ala Ala Leu Gin Glu Gin Val Phe Ser Glu Leu Gly Phe Pro 
250 255 260 



1062 



cca gee gtg eaa ege tgg gtc ate gga egg tgc ctg tgt gtg ect gag 
Pro Ala Val Gin Arg Trp Val lie Gly Arg Cys Leu Cys Val Pro Glu 
265 270 275 280 



1110 



ege age ett gee tet tac ggg gtt egg cag gat ggg gae ect get ttc 
Arg Ser Leu Ala Ser Tyr Gly Val Arg Gin Asp Gly Asp Pro Ala Phe 
285 290 295 



1158 



etc tac ttg ctg tea get ect cga gaa gee cca gee aca gga ect age 
Leu Tyr Leu Leu Ser Ala Pro Arg Glu Ala Pro Ala Thr Gly Pro Ser 
300 305 310 



1206 



ect cag cac ecc cag aag atg gac ggg gaa ctt gga cgc ttg ttt ccc 
Pro Gin His Pro Gin Lys Met Asp Gly Glu Leu Gly Arg Leu Phe Pro 
315 320 325 



1254 



cca tea ttg ggg eta ccc cca ggc ecc cag cca get gee tee age ctg 
Pro Ser Leu Gly Leu Pro Pro Gly Pro Gin Pro Ala Ala Ser Ser Leu 
330 335 340 



1302 



ecc agt eea etc cag ccc age tgg tee tgt ect tee tgc ace ttc ate 
Pro Ser Pro Leu Gin Pro Ser Trp Ser Cys Pro Ser Cys Thr Phe He 
345 350 355 360 



1350 



aat gee cca gae ege ect ggc tgt gag atg tgt age ace cag agg ccc 
Asn Ala Pro Asp Arg Pro Gly Cys Glu Met Cys Ser Thr Gin Arg Pro 
365 370 375 



1398 



tgc act tgg gae ecc ctt get gca get tec ace tagcagcea ceagaggtta 
Cys Thr Trp Asp Pro Leu Ala Ala Ala Ser Thr 
380 385 



1450 



caaggggaga gtggcecttc cetcacaagt ecgaeatctc caggcceeca etgaaetceg 



1510 



gggacetcta ctgactgctt gctgggacag tcaecagggt tggggggaag ggecaeaaaa 



1570 
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tgaaaccatt aaagaccctt aagagcc 1597 

<210> 16 

<211> 387 

<212> PRT 

<213> Homo sapiens 

<400> 16 

Met Ala Pro Pro Ala Gly Gly Ala Ala Ala Ala Ala Ser Asp Leu Gly 
15 10 15 

Ser Ala Ala Val Leu Leu Ala Val His Ala Ala Val Arg Pro Leu Gly 
20 25 30 

Ala Gly Pro Asp Ala Glu Ala Gin Leu Arg Arg Leu Gin Leu Ser Ala 
35 40 45 

Asp Pro Glu Arg Pro Gly Arg Phe Arg Leu Glu Leu Leu Gly Ala Gly 
50 55 60 

Pro Gly Ala Val Asn Leu Glu Trp Pro Leu Glu Ser Val Ser Tyr Thr 
65 70 75 80 

lie Arg Gly Pro Thr Gin His Glu Leu Gin Pro Pro Pro Gly Gly Pro 
85 90 95 

Gly Thr Leu Ser Leu His Phe Leu Asn Pro Gin Glu Ala Gin Arg Trp 
100 105 110 

Ala Val Leu Val Arg Gly Ala Thr Val Glu Gly Gin Asn Gly Ser Lys 
115 120 125 

Ser Asn Ser Pro Pro Ala Leu Gly Pro Glu Ala Cys Pro Val Ser Leu 
130 135 140 

Pro Ser Pro Pro Glu Ala Ser Thr Leu Lys Gly Pro Pro Pro Glu Ala 
145 150 155 160 
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Asp Leu Pro Arg Ser Pro Gly Asn Leu Thr Glu Arg Glu Glu Leu Ala 
165 170 175 

Gly Ser Leu Ala Arg Ala lie Ala Gly Gly Asp Glu Lys Gly Ala Ala 
180 185 190 

Gin Val Ala Ala Val Leu Ala Gin His Arg val Ala Leu Ser Val Gin 
195 200 205 

Leu Gin Glu Ala Cys Phe Pro Pro Gly Pro lie Arg Leu Gin Val Thr 
210 215 220 

Leu Glu Asp Ala Ala Ser Ala Ala Ser Ala Ala Ser Ser Ala His Val 
225 230 235 240 

Ala Leu Gin Val His Pro His Cys Thr Val Ala Ala Leu Gin Glu Gin 
245 250 255 

Val Phe Ser Glu Leu Gly Phe Pro Pro Ala Val Gin Arg Trp Val lie 
260 265 270 

Gly Arg Cys Leu Cys Val Pro Glu Arg Ser Leu Ala Ser Tyr Gly Val 
275 280 285 

Arg Gin Asp Gly Asp Pro Ala Phe Leu Tyr Leu Leu Ser Ala Pro Arg 
290 295 300 

Glu Ala Pro Ala Thr Gly Pro Ser Pro Gin His Pro Gin Lys Met Asp 
305 310 315 320 

Gly Glu Leu Gly Arg Leu Phe Pro Pro Ser Leu Gly Leu Pro Pro Gly 
325 330 335 

Pro Gin Pro Ala Ala Ser Ser Leu Pro Ser Pro Leu Gin Pro Ser Trp 
340 345 350 

Ser Cys Pro Ser Cys Thr Phe lie Asn Ala Pro Asp Arg Pro Gly Cys 
355 360 365 

Glu Met Cys Ser Thr Gin Arg Pro Cys Thr Trp Asp Pro Leu Ala Ala 
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370 



375 



380 



Ala Ser Thr 
385 



<210> 17 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 17 

gctgaagcag gaaaatcgct t 

<210> 18 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 18 

tgagacggag tctcactcgg t 

<210> 19 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<400> 



19 



gtttttccta attttggcat gaac 



24 



<210> 20 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 20 

cgcccaagct tttcctttt 19 

<210> 21 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 21 

caccctttgc ctctgtcact tccgca 26 

<210> 22 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
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<400> 22 

gctggagcac caggactgca ttg 23 

<210> 23 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 23 

ggagctgagc agcagtgtaa tgaa 24 

<210> 24 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 24 

gaggcctgcc tgaaggagga gcttc 25 

<210> 25 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 25 
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tctggaagta gtgcagacgc ctcagg 



26 



<210> 26 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 26 

agccaacgtc ggctttgtta tccagc 26 

<210> 27 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 27 

gctgtcagat atgatggttc tggac 25 

<210> 28 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 28 

ccagcctcac cactgttggg ttgc 24 
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<210> 29 

<211> 23 

<212> DNA 

<2I3> Artificial Sequence 
<220> 

<223> Primer 

<400> 29 

cattctctga gctgtattag tgt 23 

<210> 30 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 30 

cctgagctgg aatgacctgc a 21 

<210> 31 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 31 

ctttgtgttg gctgcagcca ca 



22 
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<210> 32 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 32 

tgaggagaga ctttgctgac tggt 

<210> 33 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 33 
gtcctgtctg gcggtgccga 

<210> 34 

<211> 42 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 



<400> 34 

gctccaggat cccctgtcac ctgggccttc tgccttttgg ct 
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<210> 35 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 35 

ccatatggag aggagagcag cgggccca 28 

<210> 36 

<211> 23 

<2X2> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 36 

gaaggaggaa catggagagg aga 23 

<210> 37 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 37 

ccatatgccc cgggtagtct actgcat 27 

<210> 38 
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<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 38 

gtcgactcga gtcacttccg caaaaacttc ttg 33 

<210> 39 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 39 

tccattccga aggctctcct cc 22 

<210> 40 

<211> 23 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 
<400> 40 

gtctgtgtga cggaaatgta age 23 

<210> 41 
<211> 23 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 41 

gaaggtcgaa ggcagaccga tgt 23 

<210> 42 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 42 

gaatgtgtca gagacaagtg cage 24 

<210> 43 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 43 

tgtagaaact cttggactaa tggagg 26 

<210> 44 
<211> 29 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 44 

gtatgcatca gaattcccta tagatcttt 29 

<210> 45 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 45 

tagatgtttg ggcaacagcc t 21 

<210> 46 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 46 

ttccttcacc aaaggcatcc agccattcta tg 32 

<210> 47 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
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<220> 



<223> 



Primer 



<400> 47 

aaatgtctga ttaccccatt ttatcagt 

<210> 48 

<2H> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 48 

taatcctgaa atgaacagct aaca 

<210> 49 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 49 

taatgttaga gtaacagcat tttccttcaa 

<210> 50 

<211> 29 
<212> DNA 

<213> Artificial Sequence 
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<220> 



<223> 



Primer 



<400> 



50 



tgccccacac taactcagtt cttgtgatg 



29 



<210> 51 

<211> 29 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Primer 

<400> 51 

aaggctttat caggtctgca tatagaatc 29 

<210> 52 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 52 

gcaaagaacc ctaatgctat ttatcagc 28 

<210> 53 

<211> 22 

<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> 



Primer 



<400> 



53 



gagaagacca gggaagaagc ag 



22 



<210> 54 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 54 

ggacggggaa cttggacgc 19 

<210> 55 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 

<400> 55 

aagtgcaggg cctctgggtg 20 

<210> 56 

<2H> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
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<400> 56 
cgctgggtca tcggacggt 



19 



INTERNATIONAL SEARCH REPORT 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC7 C12N 15/12, C07K 14/47, C12N 15/62, C07K 16/18, A61K 48/00 
According to International Patent Classincation (IPC) or to both national classincalion and IPC 

a FIELDS SEARCHED - 

Minimum documentaiion searched (classincation system followed by classification symbols) 
IPC7CI2N,C07K 

Documentation searched other than minimum documentaiion to the extent that such documents arc included in the fields searched 




emational application No. 
PCT/KR2003/002I6I 



Electronic data base consulted during the intertnational search (name of data base and, where practicable, search terms used) 
NCBl Gcnbank, EMBL, Swissprol. Delphion, Pubmed "IQGAP* 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


A 


Database NCBl Genbank [on line] 
Accession No. AL365 181, 22 Jan. 2002 

Human DNA sequence from clone RPn-284F21 on chromosome 1, complete sequence 


1-9 


A 


Jon W. Erickson et al. " Identification of an actin cytoskeletal complex that includes IQGAP and 
the Cdc42 GTPasc" J. Biol.Chcm. VbL272 No. 26. pp24443-24447, 26 Sep.!997 


1-33 


A 


Shinya Kuroda et al. " Identification of IQGAP as a putative target for the small GTPase, Cdc42 
and Racr J. Biol. Chem. Vol.271 No.38 pp23 363 -23367, 20 Sep. 1996 


1-33 


A 


Michael W. Briggs et al. "IQGAP 1 -mediated stimulation of transcscriptional co-activation by bcta- 
catenin is modulated by calmodulin", J. Biol. Chem. Vol.277 No.9 pp7453-7463, 1 Mar. 2002 
Cited in the application 


1-33 



I I Further documents are listed in the continuation of Box C. 



|~| See patcm family annex. 



Special categories of cited documents: 

document defining the general state of the art which is ncA considered 
to be of paiticular relevance 

earlier application or patent but published on or after the imemational 
filing date 

document which may throw doubts on priority ciaim(s) or which is 
cited to establish the publication date of citation or other 
special reason (as specified) 

document rcfemng to an oral disclosure, use, exhibition or other 



"A* 

"L" 
"O- 



"P" document published prior to the international filing date but later 
than the priority date claimed 



later document published after the international filing date or priority 
date and not in conflict with the application but cited to undbrstand 
the principle or theory underlying the invention 
document of particular relevance; the claimed Invention cannot be 
considered novel or cannot be considered to involve an inventive 
step when the doeument is taken done 

document of particular relevance; the claimed invention cannot be 
considered to inM>lve an inventive step when the document is 
combined with one or more other such docummts.such combination 
being obvnus to a person skilled in the art 
document member of the same patent family 



Date of the actual completion of the international search 
28 JANUARY 2004 (28.01.2004) 


Dale of mailing of the international search report 
28 JANUARY 2004 (28.01.2004) 


Name and mailing address of the ISA/KR 

^^^^ Korean Intellectual Property OfTicc 
■ ^ Dunsan-dong. Seo-gu, Daejeon 302-701, 
Republic of Korea 

Facsimile No, 82-42-472-7 1 40 


Authorized officer ^ — 

CHO, Myung Sun I ^ \M j 
Telephone No. 82-42-48 1 -5594 ^^SC^^ 



Form PCT/lSA/210 (second sheet) (January 2004) 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/KR2003/00216I 



This international Searching Author ity found multiple inventions in this international application 
as fol lows: 

1. Claims 1-33 (in part) 

Inventions related to an isolated nucleic acid molecule comprising SEQ ID No.t and No. 3. and 
isolated nucleic acid molecule encoding SEQ 10 No. 2 and No. 4. 

2. Clains 1-33 (in part) 

Inventions related to an isolated nucleic acid molecule comprising SEQ ID No. 5. and isolated 
nucleic acid molecule encoding SEQ ID No. 6. 

3. Claims 1-33 (in part) 

Inventions related to an isolated nucleic acid molecule comprising SEQ 10 No. 7. and isolated 
nucleic acid molecule encoding SEQ ID No. 8. 

4. Claims 1-33 (in part) 

Inventions related to an isolated nucleic acid molecule comprising SEQ ID No. 9. and isolated 
nucleic acid molecule encoding SEQ ID No. 10. 

5. Claims 1-33 (in part) 

Inventions related to an isolated nucleic acid molecule comprising SEQ ID No. 11. and isolated 
nucleic acid molecule encoding SEQ 10 No12. 

6. Claims 1-33 (in part) 

Inventions related to an isolated nucleic acid molecule comprising SEQ 10 No. 13 and No. 15, and 
isolated nucleic acid molecule encoding SEQ ID No. 14 and No. 16. 



Form PCT/ISA/210 (extra sheet) (Januaiy 2004) 



INTERNATIONAL SEARCH REPORT 


International application No. 




PCT/KR2003A)02I61 



Box No. I Nucleotide and/or amino acid sequence(s) (ContinuatioD of itcmlA of the first sheet) 



1. With regard to any nucleotide and/or amino acid sequence disclosed in the international application and necessary to the claimed 
invention, the international search was carried out on the basis of: 

a. type of material 

pC I a sequence listing 

[~[ table(s) related to the sequence listing 

b. format of material 

[ I in written fomiat 

in computer readable form 

c. time of filing/fumishing 

{ I contained in the interrational application as filed 

filed together with the international application in computer readable fonn 
I I furnished subsequently to this Authority for the purposes of search 

2. Q In addition, in tfie case tiiat more tiian one version or copy of a sequence listing and/or table relating thereto has been filed 

• or furnished, the required statements that the information in the subsequent or additional copies is identical to that in the 
application as filed or does not go beyond the application as filed, as appropriate, were furnished. 

3. Additional comments: 



Form PCT/ISA/210 (continuation of first sheet (I)) (January 2004) 



INTERNATIONAL SEARCH REPORT 


International application No. 




PCT/KR2003/002161 



Box No. II Observations where certain claims were found unsearchable (Continuation of Item 2 of first sheet) 



This international search report has not been established in respect of certain claims under Article 1 7(2)(a) for the following reasons: 

1. rn Claims Nos.: 

— because they relate to subject nutter not required to be searched by this Authority, namely: 

2. I I Claims Nos.: 

* — ' because they relate to parts of the international application that do not comply with the prescribed requirements to such an 
extent that no meaningful international search can be carried out, specifically: 

3. Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 

Box No. Ill Observations where unit y of invention Is lacking (Continuation of item 3 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 

See separate sheet 



1 . As all required addtional search fees were timely paid by the applicant, this international search report covers all sean;hable 

claims. 

2. Q As all searchable claims could be searched without effort justiiying an additional fee, this Authority did not invite payment 

of any additional fee. 

3. rn As only some of the required additional search fees were timely paid by the applicant, this international search report covers 

only those claims for which fees were paid, specifically claims Nos.: 



4. ^T] No required additional search fees were timely paid by the applicant. Consequently, this international search report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 
1-33 (in part) 

Inventions related to an isolated nucleic acid molecule comprising SEQ ID No. I and No.3, and isolated nucleic acid molecule 
encoding SEQ IDNo.2 and No.4. 

Remark on Protest The additional search fees were accompanied by the applicant's protest. 

I I No protest accompanied the payment of additional search fees. 



Fonm PCT/ISA/2 10 (continuation of first sheet (2)) (Januao* 2004) 



